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Abstract 

In the Edge-Disjoint Paths with Congestion problem (EDPwC), we are given an undirected n- 
vertex graph G, a collection JVi = {(si, ti), . . . , (s/c, tk)} of demand pairs and an integer c. The goal 
is to connect the maximum possible number of the demand pairs by paths, so that the maximum 
edge congestion - the number of paths sharing any edge - is bounded by c. When the maximum 
allowed congestion is c = 1, this is the classical Edge-Disjoint Paths problem (EDP). 

The best current approximation algorithm for EDP achieves an 0(y^)-approximation, by round- 
ing the standard multi-commodity flow relaxation of the problem. This matches the n{y/n) lower 
bound on the integrality gap of this relaxation. We show an 0(poly log fc)-approximation algorithm 
for EDPwC with congestion c = 2, by rounding the same multi-commodity flow relaxation. This 
gives the best possible congestion for a sub-polynomial approximation of EDPwC via this relaxation. 
Our results are also close to optimal in terms of the number of pairs routed, since EDPwC is known 
to be hard to approximate to within a factor of 0, ((log?i)^/'^'^+^)) for any constant congestion c. 
Prior to our work, the best approximation factor for EDPwC with congestion 2 was 0{n^^'^), and 
the best algorithm achieving a polylogarithmic approximation required congestion 14. 
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1 Introduction 



One of the central and most extensively studied graph routing problems is the Edge-Disjoint Paths 
problem (EDP). In this problem, we are given an undirected n-vertex graph G = (y,E), and a 
collection M = {(si, ti), . . . , (s^, t^)} of k source-sink pairs, that we also call demand pairs. The 
goal is to find a collection V of edge-disjoint paths, connecting the maximum possible number of the 
demand pairs. 

Robertson and Seymour |RS90 j have shown that EDP can be solved efficiently, when the number 
k of the demand pairs is bounded by a constant. However, for general values of k, it is NP-hard 
to even decide whether all pairs can be simultaneously routed via edge-disjoint paths |Kar72j . A 
standard approach to designing approximation algorithms for EDP and other routing problems, is 
to first compute a multi-commodity flow relaxation, where instead of connecting the demand pairs 
with paths, we are only required to send the maximum amount of multi-commodity flow between the 
demand pairs, with at most one flow unit sent between every pair. Such a fractional solution can be 
computed efficiently by using the standard multi-commodity flow LP-relaxation, and it can then be 
rounded to obtain an integral solution. Indeed, the best current approximation algorithm for the EDP 
problem, due to Chekuri, Khanna and Shepherd |CKS06b] . achieves an 0(-y/n)-approximation using 
this approach. Unfortunately, a simple example by Garg, Vazirani and Yannakakis |GVY93) (see also 
Section [D] in the Appendix) , shows that the integrality gap of the multi-commodity flow relaxation 
can be as large as 17 (-^/n), thus implying that the algorithm of [CKS06b] is essentially the best possible 
for EDP, when using this approach. This integrality gap appears to be a major barrier to obtaining 
better approximation algorithms for EDP. Indeed, we do not know how to design better approximation 
algorithms even for some seemingly simple special cases of planar graphs, called the brick- wall graphs 
(see section [D] of the Appendix). With the current best hardness of approximation factor standing on 
0(log^/2-'n) for any constant e (unless NP is contained in ZPTIME(nP°iy '°g") ^AZOSl IACG+10| ^. the 
approximability of the EDP problem remains one of the central open problems in the area of routing. 

A natural question is whether we can obtain better approximation algorithms by slightly relaxing the 
disjointness requirement, and allowing the paths to share edges. We say that a set V of paths is an 
a-approximate solution with congestion c, iff the paths in V connect at least OPT/a of the demand 
pairs, while every edge of G appears on at most c paths in V. Here, OPT is the value of the optimal 
solution to EDP, where no congestion is allowed. This relaxation of the EDP problem is called EDP 
with congestion (EDPwC). The EDPwC problem is a natural framework to study the tradeoff between 
the number of pairs routed and the congestion, and it is useful in scenarios where we can afford a 
small congestion on edges. 

The classical randomized rounding technique of Raghavan and Thompson [RT87| gives a constant 
factor approximation for EDPwC, when the congestion c is r2(logn/loglogn). More generally, for 
any congestion value c, factor 0(n^/'^)-approximation algorithms are known for EDPwC |AROH lBSOO[ 
IKS04]. Recently, Andrews [AndlO| has shown a randomized O (poly log n)-approximation algorithm 
with congestion c = 0(poly loglogn), and Chuzhoy jChul2] has shown a randomized 0(poly log /c)- 
approximation algorithm with congestion 14. For the congestion value c = 2, Kawarabayashi and 
Kobayashi |KKllj have recently shown an 0(n^/'')-approximation algorithm, thus improving the best 
previously known 0(-y/n)-approximation for c = 2 |AROH IBSOOj IKS04j . We note that all the above 
mentioned algorithms rely on the standard multi-commodity flow LP relaxation of the problem. It is 
easy to see that the values of the optimal solution of this LP relaxation for the EDP problem, where no 
congestion is allowed, and for the EDPwC problem, where congestion c is allowed, are within a factor c 
from each other. Therefore, the statements of these results remain valid even when the approximation 
factor is computed with respect to the optimal solution to the EDPwC problem. 

In this paper, we show a randomized 0(poly log fc)-approximation algorithm for EDPwC with con- 
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gestion 2. Given an instance {G,M) of the EDP problem, our algorithm w.h.p. routes at least 
0,{OPT / poly log A;) pairs with congestion 2, where OPT is the maximum number of pairs that can be 
routed with no congestion. Our algorithm also achieves an 0(poly log A;)-approximation when com- 
pared with the optimal solution to EDPwC with congestion 2. As all the algorithms for EDP and 
EDPwC mentioned above, our algorithm also performs a rounding of the standard multi-commodity 
flow relaxation for EDP. Therefore, our result shows that when congestion 2 is allowed, the integrality 
gap of this relaxation improves from Q{y/n) to polylogarithmic. Our result is essentially optimal with 
respect to this relaxation, both for the congestion and the number of pairs routed, in the following 
sense. As observed above, if we are interested in obtaining a sub-polynomial approximation for EDP 
via the multi-commodity flow relaxation, then the best congestion we can hope for is 2. On the 
other hand, Andrews et al. jACG"'"10 have shown that the integrality gap of the multi-commodity 
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flow relaxation for EDPwC is f2 ( ( j-. — .2 ) I for any constant congestion c. In particular, the 



integrality gap for congestion 2 is polylogarithmic, though the degree of the logarithm is much lower 
than the degree we obtain in our approximation algorithm. Andrews et al. |ACG"'"10j have also shown 

that for any constant e, for any 1 < c < O (^ lojTiog'kign ) ' there is no O ^ (log n)^^ -approximation 

algorithm for EDPwC with congestion c, unless NP C ZPTIME(nP°^y^°^"). In particular, this gives an 

(log'-^^'^^/^ n) -hardness of approximation for EDPwC with congestion 2. 



While the approximability status of the EDP problem remains open, our results show a fundamental 
difference between routing with congestion 1 and routing with congestion 2 or higher. Suppose we 
are given a solution V to the EDP problem that connects D of the demand pairs with congestion c, 
and we are interested in obtaining another solution with a lower congestion. Our results provide an 
efficient randomized algorithm to find a solution connecting (Z)/(cpoly log A;)) of the demand pairs 
with congestion 2. That is, we can lower the congestion to 2 with only a factor (cpoly log k) loss in the 
number of the demand pairs routecQ However, if we are interested in routing with no congestion, then 
we may have to lose an ^}{^/n)-^actOT in the number of pairs routed. For example, in the integrality gap 
construction of Garg, Vazirani and Yannakakis 5GVY93] , there is a solution that routes k = Q(^/n) 
pairs with congestion 2, but if we require a routing with congestion 1, then at most one pair can be 
routed (see section [D] of the Appendix) . 

We note that better approximation algorithms are known for some special cases of the EDP prob- 
lem. Rao and Zhou |RZ10] have shown that if the value of the global minimum cut in the input 
graph G is il(log^n), then there is an efficient randomized 0(poly logn)-approximation algorithm 
for EDP. Interestingly, this algorithm is also based on the multi-commodity flow relaxation. The 
EDP problem is also known to have polylogarithmic approximation algorithms on bounded-degree 
expander graphs |LR991 [BFU94, IBFSU94[ IKR961 IFriOOj . and constant-factor approximation algo- 
rithms on trees |GVY93l fCMSOTj . grids and grid-like graphs |AR95l IAGLR941 IKT981 IKT95j . Rout- 
ing problems have also been extensively studied on planar graphs. Chekuri, Khanna and Shep- 
herd |GKS04cit IGKSOGa] have shown a poly-logarithmic approximation algorithm for EDPwC with 
congestion 2 and a constant approximation algorithm with congestion 4 on planar graphs. Both 
results have recently been improved by Seguin-Charbonneau and Shepherd [SCSllj . who showed a 
constant factor approximation algorithm with congestion 2. When no congestion is allowed, Klein- 
berg |Kle05] has shown an 0(log^ n)-approximation for Eulerian planar graphs. Kawarabayashi and 
Kobayashi |KK10j have recently improved this result to an 0(logn)-approximation, for both Eulerian 
and 4-connected planar graphs. However, improving the 0(-y/n)- approximation algorithm for EDP on 
general planar graphs still remains elusive. 

^We can view the set V of paths as a fractional solution to the EDP problem instance, where 1/c flow units are sent 
along each path. This gives a fractional solution of value D/c. We can then use our algorithm to route Q (Z)/ (cpoly log k)) 
of the demand pairs with congestion 2, by rounding this fractional solution. 
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A problem closely related to EDP is Congestion Minimization, where the goal is to route all the demand 
pairs, while minimizing the maximum congestion. The classical randomized rounding technique of 
Raghavan and Thompson [RT87| . when applied to the multi-commodity flow relaxation of the prob- 
lem, achieves an 0(logn/loglogn)-approximation. This is the best currently known approximation 
algorithm for the problem. The best current hardness of approximation ratio, due to Andrews and 
Zhang [XZOTj . is n (^xSg^)' ^^^^^ assumption that NP ^ ZPTIME(nP°iy 
Our results. Our main result is summarized in the following theorem. 

Theorem 1 There is an efficient randomized algorithm, that, given a graph G, and a collection M of 
k source- sink pairs, w.h.p. finds a routing 0/ r2(0PT/(poly log /c)) of the pairs in A4 with congestion 
at most 2, where OPT is the maximum number of pairs that can be routed with congestion 2. 

We now provide an overview of our techniques and compare them to [Chul2| . We denote by T the set 
of vertices participating in the demand pairs, and we call them terminals. As in previous work |CKS05[ 
IRZlOl lAndlOl IChul2j . we use the notion of well-linkedness. Given a graph G = (y,E) and a subset 
T of vertices called terminals, we say that G is a-well linked for the terminals, iff for any partition 
{A,B)ofV,\E{A,B)\>a-mm{\AnT\,\BnT\}. Chekuri, Khanna and Shepherd |CKS04bl ICKSOSj 
have shown an efficient algorithm, that, given any EDP instance {G,A4), partitions it into a number 
of sub-instances (Gi, A^i), . . . , (G^ , A4i), such that, on the one hand, each instance Gi is 1-well-linked 
for the set of terminals participating in Mi, and on the other hand, the sum of the values of the 
optimal fractional solutions in all these instances is f](OPT/log^ k). Therefore, it is enough to find a 
polylogarithmic approximation with congestion 2 in each such sub-instance separately. From now on 
we assume that we are given an instance {G,Ai), where G is 1-well-linked for the set T of terminals. 

Chekuri, Khanna and Shepherd |CKS04b| ICKS051 ICKS04a] have suggested the following high-level 
approach to solving EDP instances {G,A4), where G is well-linked for the terminals. They start by 
defining a graph called a crossbar: a graph H with a subset Y C V{H) of vertices is called a crossbar 
with congestion c, iff any matching over the vertices of Y can be routed with congestion at most c in 
H. They then note that if we could show an algorithm that finds a crossbar (H, Y) in graph G, with 
\Y\ = A;/ poly log A;, and constant congestion, then we can obtain a polylogarithmic approximation 
to EDPwC with constant congestion. An algorithm for constructing such a crossbar with a constant 
congestion follows from the recent work of |Chul2j . 

We follow this approach, and define a structure that we call a good crossbar, which gives slightly 
stronger properties than the general crossbar defined above. For any subset S" C y of vertices, let 
out(S') denote the set of edges with one endpoint in S and one endpoint in 1/ \ S. Informally, we 
say that a vertex set S is a-well-linked for a subset F C out(S') of edges, iff the graph G[S] is 
a-well-linked for the set F of terminals. Formally, we require that for any partition (A, B) of S, 
\E{A,B)\ > a-min{|out(>l)nF|,|out(5)nF|}. 

A good crossbar consists of three parts. The first part is a good family of vertex subsets J- = 
{Si, . . . , S^}, where 7 = 0(log^ n). The sets Si, . . . , S^ are all vertex-disjoint, and only contain non- 
terminal vertices. Each set Sj is associated with a subset F* C ont{Sj) of k* = k/ poly log edges, 
such that Sj is 1-well-linked for F*. The second part of the good crossbar is a collection Ai* C of 
k* /2 demand pairs, and the third part is a collection r = Ti, . . . ,Tfc* of k* trees. Let T* denote the 
set of all terminals participating in the demand pairs in Ai* . Then for each 1 < i < k* , the tree 
contains a distinct terminal ti €z T* , and a distinct edge Cj £ T*-, for all 1 < ? < 7- In other words, 
for each set Sj G we can view F* = {eij, . . . ,ek*j}, where e^j G E{Ti) for all 1 < i < A;*. (See 
Figure [1]) 

Given a good crossbar {T , M* ,t), we say that the congestion of this crossbar is c iff every edge of 
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Figure 1: A good crossbar 

G appears in at most c trees in r, and every edge of G with both endpoints in the same set Sj € T 
appears in at most c — 1 such trees. Chuzhoy |Chul2] has imphcitly defined a good crossbar, and has 
shown that, given a good crossbar that causes congestion c, there is an efficient randomized algorithm 
to route Q{k* / poly log A;) demand pairs with congestion at most c in graph G. This algorithm uses 
the cut-matching game of Khandekar, Rao and Vazirani jKRVOGj to embed an expander into G, and 
then finds a routing in this expander using the algorithm of Rao and Zhou |RZ10| . She has also 
shown an efficient algorithm for constructing a good crossbar with congestion 14, thus obtaining an 
0(poly log A;)-approximation with congestion 14 for EDPwC. 

We follow a similar approach here, except that we construct a good crossbar with congestion 2. 
Combining this with the result of |Chul2j . we obtain an 0(poly log A;)-approximation to EDPwC with 
congestion 2. We now provide more details about the construction of the good crossbar of ^Chul2j . and 
the barriers that need to be overcome to reduce the congestion to 2. The algorithm of |Chul2] consists 
of three steps. In the first step, a good family F = {Si, . . . , 5-^} of vertex subsets is constructed, and 
for each 1 < j < 7, a subset T*- C out(5j) is selected. Additionally, for each set Sj G F, there is a set 
Vj of k* edge-disjoint paths in graph G, connecting every edge in T* to a distinct terminal in T- In 
the second step, we construct a family r' = {T[, . . . of trees, where every tree Tj E r' contains 

a distinct terminals ti G T, and a distinct edge e^j G T* for each 1 < j < 7. However, the terminals 
in set T' = {ti I Ti G t'} do not necessarily form source-sink pairs in A^. In order to construct the 
trees in r', consider the graph G' obtained from G by contracting every set Sj G F into a super- node 
Vj. Then the problem of finding the set r' of trees is similar to the problem of packing Steiner trees 
in G' . Moreover, the collections {Vj}'^^^ of paths can be interpreted as a fractional solution of value 
poly log /c) to this problem. We can then round this solution to obtain the family r' of trees, 
using known algorithms for packing Steiner trees. The fact that we need to deal with the clusters 5*^ 
instead of the super-nodes Vj makes the problem more challenging technically, but due to the fact that 
the sets Sj are well-linked for the sets T* of edges, the problem is still tractable. The well-linkedness 
of the sets Sj G F is exploited in this algorithm, by routing several matchings between the edges of 
r* across each such set Sj, in order to simulate the super- node Vj. Finally, in the third step, we select 
a subset Ai* C 7W of k* /2 demand pairs, and connect all terminals participating in the pairs in Ai* 
to the terminals in set T' ■ The union of these new paths with the trees in r' gives the final collection 
r of trees. 

There are several factors contributing to the accumulation of congestion in this construction. We 
mention the main two barriers to reducing the congestion to 2 here. The first problem is that, given 
the good family F of vertex subsets, it is possible that for some set Sj G F the paths in Vj may 
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contain edges lying in other clusters Sj' G J-'. Moreover, these paths do not necessarily enter and leave 
the set Sji through the edges in T^* . The construction of the trees in r' uses each set Sji € to route 
several matchings between the edges of T*,. If the edges of G[Sj'] additionally participate in the paths 
in {"PjjJ^^, then this leads to accumulation of congestion. Indeed, an edge of graph G may appear in 
up to 8 trees of r' in the construction of |Chul2j . Ideally, we would like to construct a good family 
of vertex subsets, where for each set Sj G J-, the paths in Vj do not contain any vertices lying in other 
subsets Sji S J-. However, it is not clear whether it is possible to achieve such strong properties. 

The second difficulty is that step 2 and step 3 are executed separately, each contributing to the total 
congestion. It appears that one has to incur a congestion of at least 2 when constructing the set r' 
of trees, using current techniques. If the terminals in set T' do not form demand pairs in Ai, then 
we need to additionally select a subset A^* C of the demand pairs, and to route the terminals 
participating in Ai* to the terminals of T' , thus increasing the congestion beyond 2. 

In order to find a good family of vertex subsets, the algorithm of |Chul2| performs a number of 
iterations. In each iteration we start with what is called a legal contracted graph G' . This graph 
is associated with a collection C of disjoint subsets of non-terminal vertices of G, such that each set 
C G C is well-linked for out(C), and G' is obtained from G by contracting every cluster G G C into 
a super-node. Additionally, we require that for each cluster G G C, \ out(C)| is small. We call such 
a clustering C a good clustering. At the beginning of the algorithm, C = and G' = G. In every 
iteration, given a legal contracted graph G' , the algorithm either computes a good family of vertex 
subsets, or produces a new legal contracted graph G", containing strictly fewer vertices than G' . This 
guarantees that after n iterations, the algorithm produces a good family of vertex subsets. 

In order to overcome the two problems mentioned above and avoid accumulating the congestion, we 
combine all three steps of the algorithm together. We define a potential function if over collections 
C of disjoint non-terminal vertex subsets, where v'(C) roughly measures the number of edges in the 
graph obtained from G by contracting every cluster in C into a super-node. The potential function 
has additional useful properties, that allow us to perform a number of standard operations (such as 
the well-linked decomposition) on the clusters of C, without increasing the potential value. 

Our algorithm also consists of a number of iterations (that we call phases). In each such phase, we 
start with some legal contracted graph G' and a corresponding good clustering C' . We then either 
construct a good crossbar, or produce a new good clustering C" with '^{C") < 9?(C'), together with the 
corresponding new legal contracted graph G" . Each phase is executed as follows. We start with some 
clustering C* of the vertices of G, where ^{C*) < ^(C), but C* is not necessarily a good clustering. We 
then perform a number of iterations. In each iteration, we select a family = {Si, . . . , S^} of disjoint 
subsets of non-terminal vertices, that we treat as a potential good family vertex subsets, and we try to 
find a subset Ai* (1 Jii of k* demand pairs and a family r of trees, to complete the construction of a 
good crossbar. If we do not succeed in constructing a good crossbar in the current iteration, then we 
use the family of vertex subsets to refine the current clustering C* , such that the potential of the new 
clustering goes down by a significant amount. This ensures that after polynomially-many iterations, 
we will succeed in either constructing a good crossbar, or a good clustering C* with '^{C*) < ^p{C'). 

This combination of all three steps of the algorithm of |Chul2j appears necessary to overcome the two 
barriers described above. For example, it is possible that the family of vertex subsets is a good 
family, but we are still unable to extend it to a good crossbar (for example because of the problem of 
the paths in Vj using the edges of G[Sj/], as described above). Still, we will be able to make progress 
in such cases by refining the current clustering C* . Similarly, we construct the trees and connect 
the terminals participating in pairs in A4* to them simultaneously, to avoid accumulating congestion. 
Again, whenever we are unable to do so, we will be able to refine the current clustering C* . 

Organization. We start with preliminaries in Section [2] and provide an overview of our algorithm in 
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Section[3j We develop machinery to analyze vertex clusterings in Section|4| and complete the algorithm 
description in Sections [5] and [6j For convenience, a list of parameters is provided in Section |Ej and 
the known lower bounds on the integrality gap of the multi-commodity flow relaxation for EDP are 
provided in Section [D] of the Appendix. 

2 Preliminaries 

We assume that we are given an undirected n-vertex graph G = (F, E), and a set 7W = {(si, ti), . . . , (sfcj 
of k source-sink pairs, that we also call demand pairs. We denote by T the set of vertices that partic- 
ipate in pairs in A^, and we call them terminals. Let OPT denote the maximum number of demand 
pairs that can be simultaneously routed via edge-disjoint paths. Given any subset M' C 7W of the 
demand pairs, we denote by T{M') ^ T the subset of terminals participating in the pairs in M' . 

Using a standard transformation, we can assume w.l.o.g. that every terminal t & T participates in 
exactly one source-sink pair: otherwise, for each demand pair in which t participates, we can add a 
new terminal to the graph, that will replace t in the demand pair, and connect this new terminal to t. 
Similarly, using standard transformations, we can assume that the degree of every terminal is exactly 
1, and the degree of every non-terminal vertex is at most 4. In order to achieve the latter property, 
we replace every vertex v whose degree > 4 with a x grid, and connect the edges incident 
on V to the vertices of the first row of the grid. It is easy to verify that these transformations do not 
affect the solution value. From now on we assume that the degree of every terminal is 1, the degree of 
every non-terminal vertex is at most 4, and every terminal participates in at most one demand pair. 

For any subset 5 C y of vertices, we denote by outG'(5') = Eg{S, V \ S), and by Eg{S) the subset of 
edges with both endpoints in S, omitting the subscript G when clear from context. Throughout the 
paper, we say that a random event succeeds w.h.p., if the probability of its success is (1 — 1/ poly(n)). 
All logarithms are to the base of 2. 

Let V be any collection of paths in graph G. We say that paths in V cause congestion r] in G, iff for 
every edge e G E, at most rj paths in V contain e. Given any pair {Ei,E2) of subsets of edges, we 
denote hy F : Ei E2 the flow where every edge in Ei sends one flow unit to the edges in E2, and 
the congestion due to the flow F is at most ry. If additionally every edge in E2 receives at most one 
flow unit, then we denote this flow hy F : Ei £'2. Similarly, given a set V of paths connecting the 
edges of Ei to the edges of -E2, we denote V : Ei £"2 iff P = {Pe \ e G Ei}, where e is the first 
edge on Pe, and the total congestion caused by the paths in V is at most rj. If every edge of E2 has at 
most one path terminating at it, then we denote V : Ei E2. We use a similar notation for flows 
and path sets connecting subsets of vertices to each other, or a subset vertices with a subset of edges. 

Given a subset S of vertices and two subsets Ei, E2 ^ out(5') of edges, we say that the flow F : Ei -w,^ 
E2 is contained in S iff every flow-path is completely contained in G[S], except for its first and last 
edges, that belong to out(S'). Similarly, we say that a set V : Ei -^^^ E2 of paths is contained in S iff 
all inner edges on every path in V belong to G[S]. 

Sparsest Cut and the Flow-Cut Gap. Suppose we are given a graph G = {V,E), and a subset 
T '^V oik terminals. The sparsity of a cut (S*, S) in G is ^{S) = . j, ^^f:"^^^ , and the value of the 

mill"! |ori / |,|i!?ri / I r 

sparsest cut in G is defined to be: ^{G) = min^^y {'^'('S')}- The goal of the sparsest cut problem is, 
given an input graph G and a set T of terminals, to find a cut of minimum sparsity. Arora, Rao and 
Vazirani (ARV 09] have shown an 0(-v/Iog~fc)-approximation algorithm for the sparsest cut problem. 
We denote this algorithm by .4arV) and its approximation factor by 0!arv(^) = 0{-\/\og k). 

A problem dual to sparsest cut is the maximum concurrent fiow problem. For the above defini- 
tion of the sparsest cut problem, the corresponding variation of the concurrent flow problem asks 
to flnd the maximum value A, such that every pair of terminals can send \/k flow units to each 
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other simultaneously with no congestion. The flow-cut gap is the maximum ratio, in any graph, 
between the value of the minimum sparsest cut and the maximum value A of concurrent flow. The 
value of the flow-cut gap in undirected graphs, that we denote by /3{k) throughout the paper, is 
G(logfc) |LR99l IGVY951 ILLR941 IAK98| . Therefore, if ^>(G) = a, then every pair of terminals can 
send -^^j^ flow units to each other with no congestion. Equivalently, every pair of terminals can send 
1/k flow units to each other with congestion at most f3(k)/a. Moreover, any matching on the set T 
of terminals can be fractionally routed with congestion at most 2/3{k)/a. 

Well-Linkedness. The notion of well-linkedness has been widely used in graph decomposition and 
routing, see e.g. (CKSOSt IRZlOl I And 1 0] . While the main idea is similar, the deflnition details differ 
from paper to paper. Our definition of well-linkedness is similar to that of |Chul2| . 

Definition 1. Let S be any subset of vertices of a graph G. For any integer ki, for any < a < 1, 
we say that set S is (fci, a)-well-linked iff for any pair Ti,T2 C out(S) of disjoint subsets of edges, 
with \Ti\ + \T2\ <ki,foT any partition {X, Y) of S with Ti C out(X) and T2 C out(y), lEciX, Y)\ > 
a ■ min{|ri|, |r2|}. 

Suppose a set S is not (fci, a)-well-linked. We say that a partition {X,Y) of 5 is a {ki, a) -violating 
partition, iff there are two subsets Ti C out(X)nout(5), T2 C out(y)nout(5) of edges with |ri|-|-|T2| < 
ki, and |^G(^,m <a-min{|ri|,|r2|}. 

Definition 2. Given a graph G, a subset S of its vertices, a parameter a > 0, and a subset 
r C out(S') of edges, we say that S is a- well-linked for F, iff for any partition {A, B) of S, \E[A, B)\ > 
a -minjlF n out(A)|, |F n out(i?)|}. We say that the set S is a- well- linked iff it is a- well- linked for the 
set out(S') of edges. 

Notice that if | out(S')| < ki, then S is a-well-linked iff it is a)-well-linked. 

Similarly, if we are given a graph G and a subset T of its vertices called terminals, we say that G 
is CK-well linked for T, iff for any partition {A,B) of V{G), \E{A,B)\ > a ■ min{|A n T], |5 n r|}. 
Notice that if G is a-well-linked for T, then for any pair (71,72) of subsets of T with |7i| = |72|, we 
can efficiently find a collection V of paths, V : 7i -^^i/a] ^2- This follows from the min-cut max-fiow 
theorem and the integrality of flow. 

Given a graph G, a subset S C V{G) of vertices, and a subset F C out(S') of edges, we deflne an 
instance SC(G, S, F) of the sparsest cut problem as follows. First, we sub-divide every edge e G F by 
a new vertex t^, and we let T(F) = {te \ e £ F}. Let Gs(r) be the sub-graph of the resulting graph, 
induced by 5* U T(F). The instance SC(G, S, F) of the sparsest cut problem is deflned over the graph 
Gs(r), where the vertices of 7'(F) serve as terminals. Observe that for all a < 1, the value of the 
sparsest cut in SC(G, S, F) is at least a iff set S is a-well-linked with respect to F in graph G. If 
F = outG(5'), then we will denote the corresponding instance of the sparsest cut problem by SC(G, S). 

The Grouping Technique. The grouping technique was first introduced by Chekuri, Khanna and 
Shepherd |CKS04bj , and has since been widely used in algorithms for network routing |CKS05t IRZIO^ 
lAndlOl IChul2j . to boost the network connectivity and well-linkedness parameters. We start with a 
simple standard grouping procedure, summarized in the following theorem. A proof of the following 
theorem can be found e.g. in jCKS04b| . 

Theorem 2 Suppose we are given a connected graph G = {V,E), with weights w{v) on vertices 
V £ V , and a parameter p, such that for each v £ V , < w{v) < p, and Ylvev '^iv) > p. Then we can 
efficiently find a partition Q ofV, and for each group U £ Q, find a tree Tjj C G containing all vertices 
ofU, such that the trees {Tu}jj^g are edge-disjoint, and for each U £G, p< J2veU ''^i'") — 



7 



We also use a more sophisticated grouping technique, due to Chekuri, Khanna and Shepherd |CKS04b] . 
The next two theorems summarize the specific settings in which the technique is used. Since these 
settings are shghtly different from the setting of jCKS04bj . we provide their proofs for completeness 
in Section [a] in the Appendix. The proofs closely follow the arguments of |CKS04b] . 

Theorem 3 Suppose we are given a connected graph G = [V, E) with maximum vertex degree at 
most 4, and r subsets Ti, ■ ■ ■ ,% of vertices called terminals (we do not require that they are disjoint). 
Assume further that G is a-well-linked for the set ljj=i 75 terminals, for some a < 1, |71| = ki, and 
that for each 1 < j < r , \Tj\ = k2, where ki,k2 > r2(r^ logr/a). Then there is an efficient randomized 
algorithm that w.h.p. computes, for each 1 < j < r, a subset Tj Tj of terminals, such that all 

sets l^^'l f^^^ mutually disjoint, \Tj \ = Q{a/r'^) ■ \Tj\ for all I < j < r, and G is 1-well-linked for 

Theorem 4 Suppose we are given a connected graph G = {V, E) with maximum vertex degree at most 
4, two subsets 71,72 of vertices called terminals, and a perfect matching M over the terminals o/71. 
Assume further that |7i| = ki, |72| = k2, where ki,k2 > 100/a, and G is a-well-linked for the set 
71 U 72 o/ terminals, for some a < 1. Then there is an efficient algorithm, that either computes a 
routing of a subset Ai' C Ji4 of 0,{aki) pairs on edge-disjoint paths, or returns two disjoint subsets 
T{ ^ Ti, T2 ^ 72, so that \T{\ = 0,{aki), |7^'| = Q{ak2), G is 1-well-linked for T{ U 7^', and for every 
pair {s,t) £ Ai, either both s and t belong to T{, or neither of them does. 

Expanders and the Cut-Matching Game. We say that a (multi)-graph G = {V, E) is an a- 

expander, iff min s<zv \ ^'^lo'i'^^^ [ > a. 

\S\<\V\/2 ^ ' ' J 

We use the cut-matching game of Khandekar, Rao and Vazirani |KRV06| . In this game, we are given 
a set V oi N vertices, where A'" is even, and two players: a cut player, whose goal is to construct an 
expander X on the set V of vertices, and a matching player, whose goal is to delay its construction. 
The game is played in iterations. We start with the graph X containing the set V of vertices, and no 
edges. In each iteration j, the cut player computes a bi-partition {Aj,Bj) of V into two equal-sized 
sets, and the matching player returns some perfect matching Mj between the two sets. The edges of 
Mj are then added to X. Khandekar, Rao and Vazirani have shown that there is a strategy for the 
cut player, guaranteeing that after 0(log^ A^) iterations we obtain a ^-expander w.h.p. Subsequently, 
Orecchia et al. |OSVV08] have shown the following improved bound: 

Theorem 5 ( |OSVV08| ) There is a probabilistic algorithm for the cut player, such that, no matter 
how the matching player plays, after 7cmg(-^) = O(log^A^) iterations, graph X is an acMG(A^) = 
Vt{\ogN)- expander, with constant probability. 

3 Algorithm Overview 

Throughout the paper, we denote by 7cmg = Icucik) = 0(log^ k) the parameter from Theorem [5] The 
central combinatorial object in our algorithm is what we call a good crossbar, that we define below. 

Definition 3. Given a graph G = {V,E), a set M oi k source-sink pairs, and a parameter k* = 
kj poly log A;, a good crossbar consists of the following three components: 

1. A family 5* = \^S\, . . . , 'S'^cmgI disjoint subsets of non-terminal vertices. Each set S* E 5* 
is associated with a subset T* C out(S'*) of 2/c* edges, and S* is 1-well-linked for F*. 
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2. A subset M* Q Ai oi k* demand pairs. Let T* = T{M*) be the corresponding set of terminals. 

3. A collection r* = {Ti, . . . ,T2k*} of trees in graph G. Each tree Tj E r*, contains a distinct 
terminal ti £ T* , and for each 1 < j < 7cmg) tree T, contains a distinct edge e^j- £ T*. In other 
words, T* = {ti, . . . ,t2k*}, where U G Tj for each 1 < i < 2A;*; and for each 1 < j < 7cmg) 
r* = {eij, . . . , e2k*,j}, where e^j S Tj for each 1 < i < 2k* . 

We say that the congestion of the good crossbar is c iff every edge of G participates in at most c trees 
in T* , while every edge in |Jj£'i'^ ^i'^j) belongs to at most c — 1 such trees. 

Our main result is that if G is 1-well-linked for the set T of terminals, then there is an efficient 
randomized algorithm, that either finds a good congestion-2 crossbar, or routes a subset oik/ poly log k 
demand pairs with congestion at most 2 in G. We summarize this result in the following theorem. 

Theorem 6 Assume that we are given an undirected graph G = {V, E) with vertex degrees at most 
4, and a set M of k demand pairs, defined over a set T of terminals. Assume further that the degree 
of every terminal is 1, every terminal participates in exactly one pair in M, and G is 1-well-linked 
for T. Then there is an efficient randomized algorithm, that with high probability outputs one of the 
following: (i) Either a subset M' C of k/ poly log A; demand pairs and the routing of the pairs in 
Ai' with congestion at most 2 in G; or (ii) a good congestion-2 crossbar {S* ,t*). 

The proof of Theorem [l] follows from the proof of Theorem |6] using techniques from previous work, 
and it appears in Section |B] of the Appendix. We note that the algorithm of |Chul2| also proceeded by 
(implicitly) constructing a good congestion-14 crossbar. Our main challenge is to reduce the congestion 
of the good crossbar to 2. From now on we focus on proving Theorem [6j 

As in |Chul2| . throughout the algorithm, we maintain a partition C of the vertices of G into clusters, 
and a legal contracted graph of G, obtained by contracting each cluster C £ C into a super- node vc- 
The algorithm performs a number of phases, where in every phase we either find the desired routing of 
a subset of demand pairs in graph G, or compute a good congestion-2 crossbar, or find a new partition 
C' of the vertices of G into clusters, whose corresponding contracted graph is strictly smaller than the 
current contracted graph. In the next section we define several types of clusterings the algorithm uses, 
the notion of the legal contracted graphs, and several operations on a given clustering, that are used 
throughout the algorithm. 

4 Vertex Clusterings and Legal Contracted Graphs 

Our algorithm uses the following parameters. Let 7cmg = 7cmg(^) = 0(log^A;) be the parameter 
for the number of iterations in the cut-matching game from Theorem [sj We let 7 = 224^4^,^^ _ We 

will also use the following two parameters for well-linkedness: a = ^n^^^^^^ = O ( \og& k ) ' ^^^^ to 
perform the well-linked decomposition, and Owl = aAHv-(fc) ~ ^ ( log'^-^ fc ) " well-linkedness factor 
we achieve. Finally, we use a parameter ki = ^92-^3 log^ = poiylog k ' assume that the parameter 

k is large enough, so fci > 4/a (otherwise we can simply route one of the demand pairs to obtain an 
0(poly log A;)-approximation). We say that a cluster C C V{G) is large iff | out(C)| > ki, and we say 
that it is small otherwise. 

Definition 4. Given a partition C of the vertices of V{G) into clusters, we say that C is an acceptable 
clustering of G iff: 

• Every terminal t € T is in a separate cluster, that is, {t} G C; 
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• Each small cluster C G C is awL-well-linked; and 

• Each large cluster C S C is a connected component. An acceptable clustering that contains no 
large clusters is called a good clustering. 

Definition 5. Given a good clustering C of G, a graph He is a legal contracted graph of G associated 
with C, iff we can obtain Hq from G by contracting every C £ C into a super-node vc- We remove 
all self- loops, but we do not remove parallel edges. (Note that the terminals are not contracted since 
each terminal has its own cluster). 

The following claim was proved in |Chul2j . We include its proof here for completeness, since we have 
changed some parameters. 

Claim 1 If G' is a legal contracted graph for G, then G' \T contains at least k/3 edges. 

Proof: For each terminal t £ T, let et be the unique edge adjacent to t in G' , and let ut be the 
other endpoint of et- We partition the terminals in T into groups, where two terminals t,t' belong 
to the same group iff ut = Ufi. Let G be the resulting partition of the terminals. Since the degree of 
every vertex in G' is at most ki, each group U £ Q contains at most ki terminals. Next, we partition 
the terminals in T into two subsets X,Y, where \X\, \ Y\ > k/3, and for each group U £ Q, either 
U C X, or [/ C y holds. We can find such a partition by greedily processing each group U £ Q, 
and adding all terminals of U to one of the subsets X or Y, that currently contains fewer terminals. 
Finally, we remove terminals from set X until \X\ = k/3, and we do the same for Y. Since graph G' 
is 1-well-linked for the terminals, it is possible to route k/3 flow units from the terminals in X to the 
terminals in Y, with congestion at most 1. Since no group U is split between the two sets X and Y, 
each flow-path must contain at least one edge of G' \ T. Therefore, the number of edges in G' \T is 
at least k/3. □ 

Given any partition C of the vertices of G, we define a potential (p{C) for this clustering. The idea is 
that 93(C) will serve as a tight bound on the number of edges connecting different clusters in C. At 
the same time, the potential function is designed in such a way, that we can perform a number of 
operations on the current clustering (such as, for example, well-linked decompositions of the clusters), 
without increasing the potential. 

At a high level, our algorithm maintains a good clustering C of V{G), where at the beginning, every 
vertex belongs to a separate cluster. The algorithm consists of a number of phases, where in each phase 
we start with some good clustering C, and either route k / poly log k of the demand pairs with congestion 
at most 2 in G, or find a good congestion-2 crossbar in G, or produce another good clustering C with 
(/7(C') < (p{C) — 1. After a polynomial number of phases, we will therefore either obtain a routing of 
A;/polylog(k) demand pairs with congestion at most 2, or find a good congestion-2 crossbar. 

Each phase of the algorithm is executed as follows. We start with the current good clustering C, 
and construct another acceptable clustering C with <^{C') < ^{C) — 1. We then perform a number 
of iterations, where in each iteration we either route k/ poly log A; demand pairs in G, or find a good 
congestion-2 crossbar, or revisit the current acceptable clustering C' , and perform an operation that 
produces another acceptable clustering, whose potential is strictly smaller than that of C . In the end, 
we will either successfully route k/ poly log A; demand pairs in G, or construct a good congestion-2 
crossbar in G, or we will find a good clustering C with (p{C') < ip{C) — 1. We now proceed to define 
the potential function, and the operations we perform on clusterings. 

Suppose we are given any partition C of the vertices of G. For any integer h, we define a potential 
^p{h), as follows. For h < ki, ip{h) = 4a log /i. In order to define ip{h) for h > ki, we consider 
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the sequence {no,ni, . . .} of numbers, where = The potentials for these numbers are 

<p{no) = <p{ki) = 4Q;IogA;i + 4a, and for i > 0, (p{ni) = 4^ + ip{ni-{). Notice that for all i, 
< 12a + 4Q;log fci < 8aIogA;i < 

We now partition all integers h > ki into sets 5*1, S2,..., where set Si contains all integers h with 
T^i-i < h < rii. For h G Si, wc define (p{h) = (p{ni-i). This finishes the definition of potentials of 
integers. Clearly, for all h, (f{h) < 

Assume now that we are given some edge e G If both endpoints of e belong to the same cluster of C, 
then we set its potential ip{e) = 0. Otherwise, if e = {u, v), and u e C with | out(C)| = h, while v e C 
with I out(C")| = h', then we set ip{e) = 1 + 'p{h) + (p{h'). We think of ip{h) as the contribution of u, 
and ip{h') the contribution of v to (p{e). Notice that (p{e) < 1.1. Finally, we set (fi{C) = J2eeE 'Pi^)- 

Suppose wc arc given any partition C of V{G). Our first step is to show that we can perform a 
well-linked decomposition of small clusters in C, without increasing the potential. 

Theorem 7 Let C be any partition of V{G), and let C E C be any small cluster, such that G[C] is 
connected. Then there is an efficient algorithm that finds a partition W of C into small clusters, such 
that each cluster R & W is a^L-well-linked, and additionally, if C is a partition obtained from C by 
removing C and adding the clusters ofW to it, then ip{C') < ip{C). 

Proof: We perform a standard well-linked decomposition of C. Wc maintain a partition W of C, 
where at the beginning, W = {C}. We then perform a number of iterations. 

In each iteration, we select a cluster S G W, and set up the following instance of the sparsest cut 
problem. First, we sub-divide every edge e G out(5) with a vertex t^, and let Ts = {te \ e G out(5)}. 
We then consider the sub- graph of the resulting graph induced hy SuTs, where the vertices of Ts serve 
as terminals. We run the algorithm ^arv on the resulting instance of the sparsest cut problem. If the 
sparsity of the cut produced by the algorithm is less than a, then we obtain a partition (X, Y) of S, 
with \E{X,Y)\ < a ■ min{| out(S') n out(X)|, | out(5') nout(y)|}. In this case, we remove S from W, 
and add X and Y instead. The algorithm ends when for every cluster S G W, .Aarv returns a partition 
of sparsity at least a. We are then guaranteed that every cluster in W is a/aAnv{k) = awL-well- linked, 
and it is easy to verify that all these clusters are small. 

It now only remains to show that the potential does not increase. Each iteration of the algorithm is 
associated with a partition of the vertices of G, obtained from C by removing C and adding all clusters 
of the current partition W of C to it. It is enough to show that if C is the current partition of V{G), 
and C" is the partition obtained after one iteration, where a set S G C was replaced by two sets X and 
Y, then ip{C") < ip{C'). 

Assume w.l.o.g. that |out(X)| < |out(y)|, so |out(X)| < 2| out(S')|/3. Let h = \out{S)\,hi = 
I out(X)|, /i2 = I out(y)|, and recall that h, h\,h2 < k\. The changes to the potential are the following: 

• The potential of the edges in out(y) n out(S') only goes down. 

• The potential of every edge in out(X)nout(S') goes down by — 99(^.1) = 4q! log /i— 4q! log hi = 
4a log ^ > 4a log 1.5 > 2.3a, since hi < 2h/2>. So the total decrease in the potential of the edges 
in out(X) n out(S') is at least 2.3a • | out(X) n out(5')|. 

• The edges in E{X, Y) did not contribute to the potential initially, and now contribute at most 
l + ip{hi) + ip{h2) < 2 each. Notice that \E{X, < a- 1 out(X)nout(S')|, and so they contribute 
at most 2a • | out(X) fl out(S')| in total. 
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Clearly, from the above discussion, the overall potential goes down. 



□ 



Assume that we are given an acceptable clustering C of G. We now define two operations on G, each 
of which produces a new acceptable clustering of G, whose potential is strictly smaller than (p{C). 

Action 1: Partitioning a large cluster. Suppose we are given an acceptable clustering C of G, a 
large cluster C gC, and a (/ci, a)-violating partition {X,Y) of C. In order to perform this operation, 
we first replace C with X and Y in C. If, additionally, any of the clusters, X or Y, become small, 
then we perform a well-linked decomposition of that cluster using Theorem [7j and update C with 
the resulting partition. Clearly, the final partitioning C is an acceptable clustering. We denote this 
operation by PARTITION(C, X, F). 

Claim 2 Let C he the outcome of operation PARTITION (C, X, y). Then ip{C') < ip{C) - 2. 

Proof: Let C" be the clustering obtained from C, by replacing C with X and Y. From Theorem [7| it 
is enough to prove that ^p{C") < 99(C) — 2. 

Assume w.l.o.g. that |out(X)| < |out(y)|. Let h = |out(C)|, hi = |out(X)|, /i2 = |out(y)|, so 
hi < 2/1/3. Assume that h G Si. Then either hi G Si' for i' < i — 1, 01 hi < ki. The changes in the 
potential can be bounded as follows: 

• The potential of every edge in out(y) n out(C) does not increase. 

• The potential of every edge in out(X) n out(C) goes down by (p{h) — (p{hi). Assuming that 
h G Si, tf{h) > + (p{hi) > + (p{hi). The total decrease in the potential of the edges in 

out(X) n out(C) is at least ^ • | out(X) n out(C)| > 2aki, since | out{X) n out(C)| > hi/2. 

• Additionally, every edge in E{X, Y) now pays 1 + ^{hi) + i/3(/i2) < 2. Since \E{X, Y)\ < aki/2, 
the total increase in this part is at most aki. 

The overall decrease in the potential is at least aki > 2. □ 

Action 2: Separating a large cluster. Let C be the current acceptable partition, and let C G C be 

a large cluster in C. Assume further that we are given a cut {A, B) in graph G, with C A, T B, 
and \Eg{A, B)\ < ki/2. We perform the following operation, that we denote by SEPARATE(C, A). 

Consider some cluster S G C. If S" is a small cluster, but 5" \ ^ is a large cluster, then we modify A by 
removing all vertices of S from it. Notice that in this case, the number of edges in E{S) that originally 
contributed to the cut {A, B), \E{S r\A,Sr\B)\ > \ out{S) n E{A)\ must hold, so | out(^)| only goes 
down as a result of this modification. We assume from now on that if 5 G C is a small cluster, then 
iS \ >1 is also a small cluster. We build a new partition C' of V{G) as follows. First, we add every 
connected component of G[A] to C. Notice that all these clusters are small, as | out(^)| < ki/2. Next, 
for every cluster S G C, such that S \ A / 0, we add every connected component of G[5 \ A] to 
C . Notice that every terminal t G T is added as a separate cluster to C So far we have defined a 
new partition C of V{G). This partition may not be acceptable, since we are not guaranteed that 
every small cluster of C is well-linked. In our final step, we perform a well-linked decomposition of 
every small cluster of C, using Theorem [7| and obtain the final acceptable partition C" of vertices of 
G. Notice that if S € C" is a large cluster, then there must be some large cluster 5' in the original 
partition C with S C S' . 

Claim 3 Let C" he the outcome of operation SEPARATE(C, A). Then (p{C") < ip{C) - 1. 
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Proof: In order to prove the claim, it is enough to prove that 9?(C') < 93(C) — 1, since, from Theorem[7| 
well-Hnked decompositions of small clusters do not increase the potential. 

We now show that ip{C') < '^{C) — 2. We can bound the changes in the potential as follows: 

• Every edge in out (A) contributes at most 1.1 to the potential of C", and there are at most 
such edges. These are the only edges whose potential in C" may be higher than their potential 
in C. 

• Every edge in out(C) contributed at least 1 to the potential of C, and there are at least ki such 
edges, since C is a large cluster. 

Therefore, the decrease in the potential is at least ki — ^-^(^i"^) >\, □ 

To summarize, given any acceptable clustering C of the vertices of G, let E' be the set of edges whose 
endpoints belong to distinct clusters of C. Then \E'\ < if{C) < 1.1\E'\. So the potential is a good 
estimate on the number of edges connecting the different clusters. We have also defined two actions on 
large clusters of C: PARTITION(C, X, y), that replaces a large cluster C with a pair X,Y of clusters, 
where {X,Y) is a (A;, a)-violating partition of C, and SEPARATE(C, ^), where A is a cut of size less 
than k/2, separating a large cluster C from the terminals. Each such action returns a new acceptable 
clustering, whose potential goes down by at least 1. 

5 The Algorithm 

In this section we prove Theorem [6j by providing an efficient randomized algorithm, that w.h.p. either 
computes a subset M' Q M oi k/ poly log A: demand pairs and their routing with congestion at most 
2 in G, or finds a good congestion-2 crossbar in G. 

We maintain, throughout the algorithm, a good clustering C of G. Initially, C is a partition of V{G), 
where every vertex of G belongs to a distinct cluster, that is, C = {{v} \ v G V{G)}. Clearly, this 
is a good clustering. We then perform a number of phases. In every phase, we start with some 
good clustering C and the corresponding legal contracted graph Hq. The phase output is one of the 
following: either (1) a subset M' C 7W of A:/ poly log k pairs, with the routing of the pairs in M' with 
congestion at most 2 in G, or (2) a good congestion-2 crossbar [S* ,M* , r*) in G; or (3) another good 
clustering C, such that </?(C') < ip{C) — 1. In the first two cases, we terminate the algorithm, and 
output either the routing of the pairs in M' , or the good crossbar. In the latter case, we continue 
to the next phase. After Odi?!) phases, our algorithm will successfully terminate with the required 
output. It is therefore enough to prove the following theorem. 

Theorem 8 Let C be any good clustering of the vertices of G, and let Hq be the corresponding legal 
contracted graph. Then there is an efficient randomized algorithm that w.h.p. computes one of the 
following: 

• Either a subset M! C 7W ofkj poly log A; pairs and a routing of the pairs in M! in graph G with 
congestion at most 2; 

• or a good congestion-2 crossbar {S* ,M* ,t*) in G; 

• or a new good clustering C , such that ip{C') < ip{C) — 1. 

The rest of this section is dedicated to proving Theorem |8j 
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We assume that we are given a good clustering C of the vertices of G, and the corresponding legal 
contracted graph G' = Hq- 

Let m be the number of edges in G' \ T- From Claim [T| m > k/3. As a first step, we randomly 
partition the vertices in G' \ T into 7 subsets Xi, . . . , X^, where each vertex v E V{G') \ T selects an 
index 1 < j < 7 independently uniformly at random, and is then added to Xj. We need the following 
claim, that appeared in |Chul2j . The proof appears in Section [C] of the Appendix for completeness. 

Claim 4 With probability at least \, for each 1 < i < 7, | o\xiG'{Xj)\ < while \EG'{Xj)\ > 

Given a partition Xi, . . . , X^, we can efficiently check whether the conditions of Claim |4] hold. If they 
do not hold, we repeat the randomized partitioning procedure. Prom Claim |4j we are guaranteed that 
w.h.p., after poly(n) iterations, we will obtain a partition with the desired properties. Assume now 
that we are given the partition Xi, . . . , X^ of V{G') \ T, for which the conditions of Claim |4] hold. 

Then for each 1 < j < 7, |^G'(^i)l > '°"^207^'^' - Let X'j C V{G) \ T be the set obtained from Xj, 
after we un-contract each cluster, that is, for each super-node vc G Xj, we replace vc with the vertices 

of C. Notice that [x'^Y ^ is a partition of V{G) \ T. 

The plan for the rest of the proof is as follows. For each 1 < j < 7, we will maintain an acceptable 
clustering Cj of the vertices of G. That is, for each 1 < i < 7, Cj is a partition of V{G). In addition 
to being an acceptable clustering, it will have the following property: 

PI. If C G Cj is a large cluster, then G Q Xj. 

The initial partition Cj, for 1 < j < 7 is obtained as follows. Recall that C is the current good 
clustering of the vertices of G, and every cluster C G C is either contained in X'j, or it is disjoint 
from it. First, we add to Cj all clusters G £ C with C n = 0. Next, we add to Cj all connected 
components of G[Xj]. If any of these components is a small cluster, then we perform a well-linked 
decomposition of this cluster, using Theorem [7| and update Cj accordingly. Let Cj be the resulting 



final partition. Clearly, it is an acceptable clustering, with property (PI). Moreover, we show that 
^{Cj) < ^{C) - 1: 

Claim 5 For each 1 < j < 7, f{Cj) < (p{C) — 1. 

Proof: Let Cj be the partition of V{G), obtained as follows: we add to Cj all clusters G £ C with 
G n Xj = 0, and we add all connected components of G[Xj] to Cj (that is, Cj is obtained like Cj, 
except that we do not perform well-linked decompositions of the small clusters). From Theorem [7| it 
is enough to prove that ^{C'j) < 95(C) — 1. The changes of the potential from C to Cj can be bounded 
as follows: 

• The edges in EQi{Xj) contribute at least 1 to 93(C) and contribute to ^{C'j). 

• The potential of edges in outG(Xj) may increase. The increase is at most (/^(n) < 1^ per edge. 

So the total increase is at most ^ "^^2^^^^^ — ^^^^1^^. These are the only edges whose potential 
may increase. 

Overall, the decrease in the potential is at least > ^ > _k_^ > ^ 

If any of the partitions Ci, . . . ,C^ is a good partition, then we have found a good partition C' with 
(/7(C') < f(C) — 1. We terminate the algorithm and return C. Otherwise, we select an arbitrary large 
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cluster Sj G Cj. We then consider the resulting collection Si,...,S-./ of large clusters, and try to 
exploit them to construct a good crossbar. Since for each 1 < j < 7, Sj C X'j, the sets 5i, . . . , 5^ are 
mutually disjoint and they do not contain terminals. Our algorithm performs a number of iterations, 
using the following theorem. 

Theorem 9 Suppose we are given, for each 1 < j < 7, an acceptable partition Cj of V{G) that has 
the property {PI), and contains at least one large cluster Sj. Then there is an efficient randomized 



algorithm, that w.h.p. computes one of the following: 



• Either a subset Ai' ^ M of /c/ poly log A; demand pairs, and a routing of pairs in Ai' with 
congestion at most 2 in G; 

• Or a good congestion-2 crossbar {S* ,A4* ,t*); 

• Or a (ki, a) -violating partition (X,Y) of Sj, for some 1 < i < 7; 

• Or a cut {A, B) in G with Sj <^ A, T <^ B and {EdA, B)\ < ki/2, for some 1 < i < 7. 



We provide the proof of Theorem [9] in the following section, and complete the proof of Theorem [8] 
here. Suppose we are given a good partition C of the vertices of G. For each 1 < i < 7, we compute an 
acceptable partition Cj of V{G) as described above. If any of the partitions Cj is a good partition, then 
we terminate the algorithm and return Cj. From the above discussion, ip{Cj) < (p{C) — 1. Otherwise, 
for each 1 < i < 7, we select any large cluster Sj £ Cj, and apply Theorem [9] to the current family 
{^j}]=i of acceptable clusterings. If the outcome of Theorem j9] is a subset A4' of demand pairs with a 
routing of these pairs in G, then we terminate the algorithm and return this routing. If the outcome is 
a good congestion-2 crossbar, then we terminate the algorithm and return this good crossbar. We say 
that the iteration is successful if one of these two cases happen. Otherwise, we apply the appropriate 
action: PARTITI0N(5j, X, 1"), or SEPARATE(S'j, ^) to the clustering Cj. As a result, we obtain an 
acceptable clustering Cj, with v?(Cj) < ^(Cj) — 1. Moreover, it is easy to see that this clustering also 



has Property (PI ): if the PARTITION operation is performed, then we only partition existing clusters; 
if the SEPARATE operation is performed, then the only large clusters in the new partition Cj are 
subsets of large clusters in Cj . 

If all clusters in Cj are small, then we can again terminate the algorithm with a good partition Cj, with 
fi^'j) ^ — 1- Otherwise, we select any large cluster Sj E Cj, and continue to the next iteration. 
Overall, as long as we do not complete a successful iteration, and we do not find a good clustering C 
of V{G) with ip(C') < ip{C) — 1, we make progress in each iteration by decreasing the potential of one 
of the partitions Cj by at least 1, by performing either a SEPARATE or a PARTITION operation on 
one of the large clusters of Cj . After polynomially many iterations we are then guaranteed to complete 
a successful iteration, or find a good clustering C with (p{C') < (p{C) — 1, and finish the algorithm. 
Therefore, in order to complete the proof of Theorem [8] it is now enough to prove Theorem |9j 

6 Proof of Theorem [o] 

Let TZ = {Si, . . . , S.y}. 

Throughout the algorithm, we will sometimes be interested in routing flow across the sets Sj £ TZ. 
Specifically, given two subsets r,r' C out(5j) of edges, with |r| = |r'| < ki/2, we will be interested 
in routing the edges of P to the edges of P' inside Sj, with congestion at most 1/a. In other words, 
we will be looking for a flow F : P ^i/q, P'. Notice that if such a flow does not exist, then we can 
find a (A;i, a)-violating partition {X,Y) of Sj, by using the min-cut max- flow theorem. We can then 
return this partition and terminate the algorithm. Therefore, in order to simplify the exposition of 
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the algorithm, we will assume that whenever the algorithm attempts to find such a flow, it always 
succeeds. Prom the integrality of flow, we can also find a set 7^ : F ~-^|"i/a] of paths contained in Sj. 

We start by verifying that for each 1 < j < 7, the vertices of Sj can send ki/2 flow units with no 
congestion to the terminals. If this is not the case for some set Sj, then there is a cut {A,B) with 
Sj C A, T C B and \Eg{A,B)\ < ki/2. We then return the partition {A,B) of G and finish the 
algorithm. Prom now on we assume that each set Sj can send A;i/2 flow units with no congestion to 
the terminals. 

Let G' be the graph obtained from G by replacing every edge of G by two bi-directed edges. The 

rest of the proof consists of three steps. In the first step, we construct a degree-3 tree T, whose 
vertex set V{T) = {ys \ S G 7^'}, for a large enough family TZ' C TZ oi vertex subsets, and each edge 
e = {vs,vs') in tree T corresponds to a collection Ve of paths in graph G', connecting the vertices of 
S to the vertices of S' or vice versa. Moreover, we will ensure that the paths in UeesCf ) cause 
congestion 1 in G' . In the second step, we find a subset Ai* C 7W of the demand pairs, and route the 
terminals in T{A4*) to the vertices of 5 U S' , where {S, S') is some pair of vertex subsets in TZ' . In 
the final third step, we construct a good congestion-2 crossbar. 

6.1 Step 1: Constructing the Tree 

This step is summarized in the following theorem. 

Theorem 10 There is an efficient algorithm, that either computes a {ki, a) -violating partition of 
some set S eTZ, or finds a subset TZ' C.TZ of size r = 87cmg; 0, tree T of maximum degree 3 and vertex 
set V(T) = {vs \ S e TZ'}, and a collection Ve of k2 = ^ (^ ^^"3"^^ ^ paths in graph G' for every edge 
e G E{f), such that: 

• For each edge e = {vs,vs') € E[T), every path P £ Ve connects a vertex of S to a vertex of S' , 
or a vertex of S' to a vertex of S, and it does not contain the vertices of \Jg„^.j^, S" as inner 
vertices. 

• The set UeeE(T) of paths causes congestion at most 1 in G' . 

Proof: Since the set T of terminals is 1-well- linked, every pair {Sj,Sji) of vertex subsets can send 
ki/2 flow units to each other with congestion at most 3 (concatenate the flows from Sj to a subset 
71 of the terminals, from 5*^/ to a subset 72 of the terminals, and the flow between the two subsets 
of the terminals). Equivalently, for each pair {Sj,Sj'), there are at least [A;i/6j edge-disjoint paths 
connecting Sj to Sj in G. We can assume that these paths do not contain any terminals, as the degree 
of every terminal in G is 1. We say that a path P is direct iff it does not contain the vertices of 
TU ([Jj=i Sj^ as its inner vertices. 

We build the following graph Z. Let k' = [^J = poiy*iog k ■ "^^^ vertices of Z are {vi, . . . , v^}, where 
vertex Vj represents the set Sj. There is an edge {vj,Vji) for j ^ j' in Z iff there are at least k' direct 
edge-disjoint paths connecting Sj to Sji in G. (Notice that this can be checked efficiently). 

It is easy to verify that graph Z is connected: indeed, assume otherwise. Let A be any connected 
component of Z, and let B contain the rest of the vertices. Let Vj G A, Vji G B. Since there are 
at least [A;i/6j edge-disjoint paths connecting Sj and Sji in G, and these paths do not contain the 
terminals, there must be at least [A;i/6j direct edge-disjoint paths connecting the vertices of IJ^.g^iSi 
to the vertices of Yj^.^B ^i- Since |^| -|- \B\ = 7, at least one pair [Si, Si') with Vi G A, Vi' G B, has at 
least k' direct edge-disjoint paths connecting them. 
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Let T be any spanning tree of Z, and let 7' = 7^/^. For every pair Vi,Vj of vertices in graph Z, let 
n{vi,Vj) be the maximum number of direct edge-disjoint paths connecting Si to Sj in G. For every 
edge e = {vi,Vj) of the tree T, let w(e) = n{vi,Vj), and let tt;(T') = J2eeE(T) w{e). Root the tree T at 
any vertex. We now perform a number of iterations, that, informally, aim to maximize the number 
of leaves and the value w{T) of the tree T. In every iteration, one of the following two improvement 
steps is performed, whenever possible. 

Improvement Step 1: Let Vj be any vertex of the tree T, whose parent Vji is a non-root degree-2 
vertex. If there is any non-leaf vertex vyt^ such that Vjn is not the descendant of Vj in the tree T, and 

the edge {vj^Vjn) belongs to the graph Z, then we delete the edge (vj,Vji) from the tree T, and add 
the edge {vj,Vj") to it. Notice that this operation increases the number of leaves in T by 1. 

Improvement Step 2: Let Vj be any non-root degrce-2 vertex of the tree, whose parent Vj' is 
also a dcgree-2 non-root vertex. Let e = {vj,Vji) be the edge of tree T connecting Vj to its father. 
Consider the two connected components of T \ e, and let e' G E(Z) be the edge connecting these two 
components, with the maximum value w{e'). If w{e') > w{e), then we delete e from T and add e' to 
it. Notice that this operation does not decrease the number of leaves in T, and it increases w{T) by 
at least 1. 

We perform one of the above two improvement steps, while possible. Since each such improvement 
step either increases the number of leaves in T, or leaves the number of leaves unchanged but increases 
w{T) by at least 1, after polynomially many improvement steps, we obtain a final tree T, on which 
none of these improvement steps can be executed. Let L denote the set of leaves of T. 

We say that Case 1 happens, if T contains at least 7' leaves. Otherwise, we say that Case 2 happens. 
We consider each of the two cases separately below. Since Case 2 is easier to analyze, we consider it 
first. 

Case 2 A path P in the tree T is called a 2-path iff it does not contain the root of T, and all 
its vertices have degree 2 in T. A 2-path P is maximal iff it is not a strict sub-path of any other 
2-path. Since the number of leaves in T is less than 7' = 7^/^, the number of maximal 2-paths in T 
is at most 27'. Therefore, T contains at least one 2-path of length at least 7'. We let P' be such a 
2-path, that minimizes the size of the sub-tree rooted at the last (lowest) vertex of P' . Recall that 
1^ = 87CMG ^ [7'/^! • Let P be the sub-path of P' , consisting of the last r vertices of P'. Assume w.l.o.g. 
that P = (f 1, . . . , Vr), where Vr is the vertex that lies deepest in the tree T. We set TZ' = {Si, . . . , Sr}, 
and we will build a tree T over the set vi,. . . ,Vr of vertices as required. In fact in this case, the tree 
T will be a path, {vsi, ■ ■ ■ ,vsr)- Therefore, we only need to define, for each edge e = {vs^,vs^_^_^) on 
this path, the corresponding set Ve of paths in graph G' . 

Since the number of leaves in the original tree T is at most 7', and every 2-path in the sub-tree of Vr 
has length at most 7' (by the choice of P'), the size of the sub-tree rooted at Vr is bounded by 87'^. 

We need the following claim: 

Claim 6 For each 1 < i < r, n{vi,Vi^i) > ^4^3/2 ? '^^^^ for each 1 < < r with \i — i'\ > 1, 
n{vi,Vi>) < 

Proof: Fix some 1 < i < r, and consider the edge e = {vi,Vi^i). Notice that the size of the sub-tree 
rooted at fj+i is bounded by r -|- 87'^ < 47'^, since the size of the sub-tree rooted at Vr is bounded 
by 87'^. Let Ti,T2 be the two connected components of r\ {e}. Recall that there are at least [A;i/6J 
direct paths connecting the sets corresponding to vertices of Ti to the sets corresponding to vertices 
of T2. So there is at least one pair v G Ti, v' G T2 of vertices, with n{v,v') > ,2 = .^3/2 • Since 
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we could not perform the second improvement step, the weight of the edge {vi,Vi-^-l) must be at least 

fcl 
2473/2 ■ 

In order to prove the second assertion, consider some pair Vi,Vii of vertices on path P, where — > 1, 
and assume w.l.o.g. that Vi is a descendant of Vi'. Since we could not execute the first improvement 
step for vertex Vi, there is no edge {vi,Vi') in graph Z. Therefore, n{vi,Vi') < □ 

For each 1 < i < r, let Vi be the set of [ ^4^3/2 1 direct edge-disjoint paths, connecting Si to S'j+i. While 

the paths in each set Vi are edge-disjoint, the total congestion caused by the set \JiVi of paths may 
be as high as (r — 1). In order to overcome this difficulty, we perform edge-splitting on an auxiliary 
graph H, constructed as follows. Start with the set of all the vertices and edges lying on the paths in 
\JilZlVi. Next, for each 1 < i < r, we contract the vertices of Si, that belong to the current graph, 
into a single vertex Vi. Finally, we replace every edge by a pair of bi-directed edges. Let H be the 
final directed graph. Observe that H is a directed Eulerian graph, that has the following properties: 



• Graph H does not contain any terminals, or vertices from the sets S £ TZ\1Z' . 

• For each 1 < i < r, there are at least [ 24^3/2 1 edge-disjoint paths connecting Vi to f j+i, that do 
not contain any vertices in {^1,^2, • • . , Vr} as inner vertices. 

• For all 1 < i 7^ < r, with |i — i'| > 1, there are at most ^ paths connecting Vi to Uj/, that do 
not contain the vertices of {ui, ^2, . . • , I'r} as intermediate vertices. 



We use the following theorem to perform edge-splitting in graph H. Let D = {V, A) be any directed 
multigraph with no self-loops. For any pair {v,v') G y of vertices, their connectivity X{v,v';D) is the 
maximum number of edge-disjoint paths connecting v to v' in D. Given a pair a = {u,v), b = {v,w) 
of edges, a splitting-off procedure replaces the two edges a, 6 by a single edge {u,w). We denote by 
D"'^ the resulting graph. We use the extension of Mader's theorem |Mad78j to directed graphs, due 
to Frank jFra89| and Jackson |Jac98j . Following is a simplified version of Theorem 3 from |Jac98| : 

Theorem 11 Let D = {V,A) be an Eulerian digraph, v £ V and a = {v,u) £ A. Then there is an 
edge b= {w,v) £ A, such that for aUy,y' £ V\{v}: X{y,y';D) = X{y , y' ; D"-^) . 



We iteratively perform edge-splitting in graph H using Theorem 11, by repeatedly choosing vertices 
V {vi . . . ,Vr}, until all such vertices become isolated. Let H' denote the resulting graph, after we 
discard all isolated vertices and make all edges undirected. Then V{H') = {vi, . . . ,Vr}, and for each 
1 < z 7^ i' < r, each edge e = {vi,Vi') corresponds to a direct path Pg in graph G' , connecting some 
vertex of Si to some vertex of S^/ (that is, Pe does not contain any vertices of {\JseTZ 5') U T as inner 
vertices). Moreover, the set {Pe \ e £ E{H')} of paths causes congestion at most 1 in G', and the edges 
e' £ E{G) whose both endpoints belong to the same set Si do not lie on any such path Pe- 

We need the following claim. 

Claim 7 For each 1 < i,i' < r with \i — i'\ > 1, there are at most \^~\ parallel edges {vi,Vii) in H' , 
and for each 1 <i <r, there are at least ^^3/2 parallel edges {vi,Vi-^-i) in H' . 

Proof: Assume for contradiction that the first assertion is false, and let be a pair of vertices in 

H' with 1 < < r and |i — «'| > 1, such that there are more than \^^'\ parallel edges {vi,Vi') in 
H' . Let V be the set of paths in graph G, containing, for each such parallel edge e, the corresponding 
path Pe- Then V' contains more than \^~\ direct paths, connecting the vertices of Si to the vertices 
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of Sil in G. Moreover, the paths in V' cause congestion at most 2 in G. Therefore, there is a flow F 
between Si and 5j' of value at least ^[^1 + ^, with no congestion in graph G, where all flow-paths 

are direct paths. From the integrality of flow, there is a set of at least ^ direct edge-disjoint paths 
connecting 5, to 5j' in G, contradicting Claim [6} 

We now turn to prove the second assertion. Assume otherwise, and let 1 < i < r be some index, such 
that there are fewer than parallel edges (vj, fi+i) in graph H' . Consider the following partition 

(^4, B) of V{H'): A = {ui, . . . , Wj}, B = {vj+i, . . . , Vr}- Recall that graph H contained at least ^^^572 
edge-disjoint paths connecting Vi to fj+i. Since the edge splitting operations preserve the connectivity 
between Vi and tij/, graph H' must contain at least ^-pW edge-disjoint paths connecting Vi to fj+i. 



In particular, \Ehi{A, B)\ > ^^3/2 ■ Let E' C Eh'{A,B) contain all edges between vertex sets A and 
B, except for the set of parallel edges {vi,Vi^i). Since there are fewer than such parallel edges, 

1^ I ^ 4873/2- 

2 

Notice however that there are at most ^ — ^ pairs of vertices {vj,Vj) G AxB, where {vj,Vj') 7^ {vi, fj+i). 
Each such pair contributes at most edges to E' , from the flrst assertion. Therefore, 

\E'\< -r-l]-\:^]<-r:^< 



4 y ' 372 ' - 1272 4873/2 
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as r < [^] < ^ = a contradiction. 

Our flnal tree T is simply a path connecting the vertices {vs-i^,vs2, ■ ■ ■ ,^3^) in this order. For each 
edge e = (vs., vs^+i) on this path, we let be the set of paths in graph G' corresponding to the set 
of parallel edges connecting Vi to fj+i in graph H' . We discard paths from each set Ve until \Ve\ = ^2- 
It is immediate to verify that Uee£;(T) cause congestion at most 1 in G', and moreover, each such 
path does not contain the vertices of Us'e7^' ^ inner vertices. 

Case 1 Recall that in this case, T contains at least 7' leaves. Let TZ' Q TZ he any set of r < 7' 
vertex subsets, corresponding to the leaves in tree T. For simplicity of notation, we will assume 
that TZ' = {Si, . . . , Sr}- Our first step is to select, for each S G TZ' , a subset T(S) C out(S') of 
k^ = ^l{k'a ■ Oy^r-^/r^) edges, such that for each S,S' G TZ' , there is a set T'{S,S') : T{S) T{S') of 
paths in graph G. We do so in the following claim. 

Claim 8 There is an efficient algorithm, that either computes a {ki, a) -violating partition for some 
set S G TZ, or computes, for each set S G TZ' , a subset T{S) C out(5) of k^ = Q,{k'a • Owl/?^^) edges, 
such that for each S, S' G TZ' , there is a set T'{S, S') : T{S) -^1 r(S") of paths in graph G, that do not 
contain the vertices o/U5"e7e' inner vertices. 

Proof: Let Vj* be the root of the tree T, and let Sj* G 7^ be the corresponding vertex subset. We 
claim that for each set Sj G 7^', there is a flow Fj of value A:', connecting the vertices of Sj* to the 
vertices of Sj in G, with congestion at most ^ + 7 < such that the paths in T^j do not contain any 
vertices of Usgt^' ^ inner vertices. Indeed, consider the path {v* = vi-^^Vi^, . . . ,Vi^ = Vj) in the tree 
T, connecting Vj* to Vj. For each edge = {vi^,Vi^+i) on this path, there is a set Qz of k' direct 
edge-disjoint paths connecting the vertices of Si^ to the vertices of Si^^-^ , from the definition of the 
graph Z. Denote 

F^ = {e' G out(S'j^) I e is the first edge on some path in Qz] , 
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and similarly 



I'l^i = {e' G out(5i^_|_j) I e' is the last edge on some path in Qz} ■ 

For each 1 < z < /i, we now have r^,r^ C out(5j^), two subsets of edges of size k' < ki/2. 
From our assumption, there is a flow F'^ : F], -^i/a F^ inside set 5*2. Concatenating the flows 
{Qi, Q2, F2, ■ ■ ■ , Q/i-i), we obtain the desired flow Fj of value k' . The total congestion caused 
by paths in IJ2=i Qz is at most 7 (since /i < 7, and the paths in each set Qz are edge-disjoint), while 
each flow Fi. causes congestion at most 1/a inside the graph G[Si^]. Therefore, the total congestion 
due to flow Fj is bounded by - + 7 < - . 

J •'a I — a 

Scaling all flows Fj, for Sj £ TZ' down by factor 2r/a, we obtain a new flow F, where every set Sj £ TZ' 
sends ^ flow units to Sj*, and the total congestion due to F is at most 1. From the integrality of 
flow, there is a collection {'PjY-^^ of path sets, where for each 1 < j < r, set Vj contains [^J paths 
connecting Sj to Sj*, the paths in IJ^j^^Vj are edge-disjoint, and they do not contain the vertices of 
\Jij=i Sj as inner vertices. We will also assume w.l.o.g. that the paths in Uj=i do not contain the 
vertices of Sj* as inner vertices. 

For each I < j < r, let Ej C out(5'j*) be the set of edges e, such that e is the last edge on some path 
in Vj, and let E' = Uj=i ^j- Notice that \E'\ = r ■ [^J < ki. We set up an instance of the sparsest 
cut problem as follows. First, partition every edge e S £" by a vertex t^, and let T be the set of the 
resulting new vertices. Consider the sub-graph G* of the resulting graph induced by Sj* U T, with the 
set T of terminals. We apply the algorithm ^arv to the resulting instance of the sparsest cut problem. 
If the algorithm returns a cut whose sparsity is less than a, then we have found a (fci, a)-violating 
partition of Sj* . We return this partition, and terminate the algorithm. Otherwise, we are guaranteed 
that Sj* is awL-well- linked for E' . 

We then apply Theorem [3] to the same graph G* , with the set T of terminals, where the terminal 
subsets are defined to be Tj = {te \ e £ Ej} for I < j < r. Notice that for all j, \Tj\ = k/ poly log A:, 
while r = poly log /c and Owl = 1/ polylog/c, so the conditions of Theorem |3] hold. From Theorem [3| 
we obtain, for each 1 < j < r, a subset T- C TJ- of O ^ ^ "^"^^'^ ^ = ^3 terminals, such that G* is 1-well- 
linked for [j^T- ■ Let E'- C Ej be the set of edges corresponding to the vertices in T- , let "P' C Vj be 
the set of paths terminating at the edges of Ej, and let T{Sj) C out(S'j) be the set of edges where the 
paths in Vj start, for each 1 < j < r. Then for each 1 < j < r, |F(S'j)| = \Tj \ = k^. Moreover, we 

claim that for all 1 < < r, there is a set V{Sj,Sji) : ^{Sj) -^1 F(5j') of paths in graph G, that 
do not contain the vertices of U5e7^' ^ as inner vertices. Indeed, since Sj* is 1-well-linked for (Jj-^i! 
there is a set Vj^ji : Ej Eji of paths contained in Sj*. The set V{Sj, Sj') of paths is obtained by 
concatenating P^- , Pjjv andV'j,. □ 

Our next step is to perform edge splitting, similarly to Case 2. We build an auxiliary graph H from 
graph G, as follows. We delete from G all terminals, and for each 1 < j < r, we delete all edges in 
out(5'j) \ F(S'j). Next, we contract each set Sj into a super-node Vj, for all 1 < j < r. Finally, we 
replace every edge in the resulting graph by a pair of bi-directed edges. Let H be this final graph. 
Notice that in-degree and the out-degree of every super-node Vj, for 1 < j < r, in graph H is exactly 
k^, and for every pair Vj,v'j of such super-nodes, there are fcs edge-disjoint paths connecting Vj to 
Vji, and ^3 edge-disjoint paths connecting Vj' to Vj in H. We iteratively perform edge-splitting in 



graph H using Theorem 11 by repeatedly choosing vertices v {vi . . . ,Vr}, until all such vertices 
become isolated. Let H' denote the resulting graph, after we discard all isolated vertices. Then 
V{H') = {vi, . . . ,Vr}, and for each 1 < i ^ i' < r, each edge e = {vi,Vi') corresponds to a path 



e 



in graph G' , connecting some vertex of Si to some vertex of Sj/, and Pg does not contain any vertices 
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of Usell' inner vertices (recall that G' is the directed graph obtained from G by replacing each 
edge with a pair of bi-directed edges). Moreover, the set {Pe \ e £ E{H')} of paths causes congestion 
at most 1 in G' . For each vertex Vj in graph H' , the in-degree and the out-degree of vj are equal to 
/cs, and for each pair Vj,Vj' of vertices, there are exactly edge-disjoint paths connecting Vj to Vj' in 
H' , and k^ edge-disjoint paths connecting vj' to vj. 

We now construct a graph Z' , whose vertex set is {vi, . . . ,Vr}, and there is an edge {vi,Vj) in graph 
Z' iff the total number of parallel edges {vi,Vj) and {vj,Vi) in graph H' is at least 2k3/r^. The proof 
of the next claim is identical to the proof of Claim 3 in |Chul2| and is omitted here. 

Claim 9 There is an efficient algorithm to find a spanning tree T of maximum vertex degree at most 
3 in graph Z' . 

We output the tree T as our final tree. Notice that each edge e = {vi,Vj) G E{T) corresponds to a 
collection Ve of paths, where each path connects either a vertex of Vi to a vertex of vj, or vice versa 
in graph G' (these are the paths that correspond to the edges of H' in the original graph G' , obtained 
by the edge splitting procedure). Moreover, the paths in Ues-ECT) cause congestion at most 1 in G' , 
and these paths do not contain the vertices of Uj=i as inner vertices. The number of paths in each 
set is: 



I ^ 2/c3 „fk'a-a^n\ ^ f kia ■ a^A „ f kia ■ a^n\ , 

Ve\>^=^ J = ^ y ^2^6 J = ^ y ^3.5 J = ^2- 

□ 



6.2 Step 2: connecting the terminals 

We assume w.l.o.g. that IZ' = {Si, . . . ^Sr}- In this step we connect a subset of terminals to two 
subsets 5, S' G {5*1, . . . , Sr}, using the following theorem. 

Theorem 12 There is an efficient algorithm, that either finds a subset Ai' C M of k / poly log k 
demand pairs, and a routing of the pairs in Ai' via edge-disjoint paths in G, or finds a subset i C 7W 
0/^4 = 0(^2/^^) demand pairs, two subsets of paths in G, and for each edge e E E{T), a subset 

V'f, C Ve of [A;2/2j paths, such that: 



• We are given a partition of T[M.i) into two subsets T{,Ti, where for each pair (s,t) G -Mi, 
s £T{,t £ Ti , or vice versa. 

• There are two sets S,S' G TZ' (with possibly S = S'), such that V[ : T{ out(5), Vi : TC 
out(5'). 

• The paths in V[ U Vi U ^Uee-B(T') ^ej '^^ ""^^^ contain any vertices of Uj=i ^3 inner vertices, 
and they cause congestion at most 2 in G. 



Proof: 

We start with the following lemma, whose proof uses standard techniques. 

Lemma 1 Let S € TZ be any set. Then there is an efficient algorithm, that either finds a subset 
M' C of k/ poly log demand pairs and a routing of M' via edge-disjoint paths in G, or finds a 
subset C Ai of Q{ki) pairs, and a set V : T{Mo) -^i out(S') of paths in graph G, connecting 
every terminal in T{Mo) io some edge of out (S) via edge-disjoint paths. 
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Proof: Recall that we have assumed that there is a flow F of value ki/2 and no congestion in graph 
G between out(5) and T. From the integrality of flow, there is a set V* of [fci/2j edge-disjoint paths 
connecting the terminals in T to the edges in out(/S'). Recall that the degree of every terminal in 
G is 1, so every terminal is an endpoint of at most one path in V* . Let To CI 7~ be the subset of 
[/!;i/2j terminals that serve as endpoints of paths in V* . We partition the remaining terminals into 
h = \ |^fcjY2j "I ~ 0(7^ log 7) subsets of at most [fc/2j terminals each. Since the set T of terminals is 
1-well-linked, for each 1 < j < /i, there is a flow J^j : 7j -^1 To- Concatenating flow Fj with the paths 
in V* , and taking the union of all resulting flows over all 1 < j < h, we obtain a flow F' , where every 
terminal in T sends one flow unit to some edge in out (5), and the congestion caused by F' is at most 
2h = 0(73 log 7). 

Our next step is to perform a grouping of the terminals, using Theorem[2j with the parameter q = 20h. 
Let Q be the resulting partition of the terminals. Recall that each set U £ Q contains at least q and 
at most 3q terminals, and it is associated with a tree Tfj, containing the terminals of U, such that the 
trees {Tu}^^g are edge- disjoint. 

We partition Ai into two subsets: M'l containing all pairs (s, t) where both s and t belong to the 
same group U £ G, and containing all remaining pairs. 

Assume first that \Mi\ > |7W|/2. We then select a subset M' C J\A'i of demand pairs as follows: for 
each group U £ Q, we select one arbitrary pair (s,t) G A4'i with s,t £ U, if such a pair exists, and 
add it to A4' . Notice that since every group contains at most 6O/1 terminals, the number of pairs in 
M' is at least = = ^ ( poiylogfc )- Every pair {s,t) £ M[ can be connected by some 

path contained in the tree Tu, where s,t £ U. Since the trees {T[/}c/gg are edge-disjoint, the resulting 
routing of the pairs in Ai' is via edge-disjoint paths. We assume from now on that |A^2| > |A^|/2. 
We need the following simple claim. 

Claim 10 We can efficiently find a subset M." C M.'^ of at least A;i/960 demand pairs, such that there 
is a flow F* : T{M") ^i.i out(5) in G. 

Proof: We start with M" = 0. While M2 ^ 0, select any pair (s, t) £ M'2 and add it to M" . Assume 
that s £ U , t £ U' . Remove from all pairs {s' ,t'), where either s' or t' belong to C/ U U' . Notice 
that for each pair added to M." , at most 6q = 120/i pairs are deleted from A^2- Therefore, at the end 
of this procedure, \M"\ > 2!)^ > 4 " it = Ife- 

We now show that there is a flow F* : T{M") -^1,1 out(S'i) in G. Let F" be the flow F', scaled 
down by factor 20h. Then every terminal in T sends l/(20/i) flow units to the edges of out(S'), and 
the total congestion caused by F" is at most 1/10. In order to define the flow F*, consider some 
terminal t £ T{M."), and let Ut £ G he the group to which it belongs. Let C Ut he any subset of 
20h terminals in Ut- Terminal t then sends l/(20/i) flow units to each terminal in Uj-, inside the tree 
T[/j. This flow is then concatenated with the flow that leaves each terminal in Uj- in F" . Taking the 
union of this flow for each t £ T{M"), we obtain a flow F* , where every terminal in T{Ai") sends 
one flow unit to the edges of out (5). In order to bound the edge congestion, we observe that flow 
F" contributes at most 1/10 congestion to every edge. Since the connected components {Tc/t}tgr(A^") 
are edge-disjoint, the flow inside these components contributes at most 1 to the total edge congestion. 
Overall, the congestion caused by the flow F* is at most 1.1. □ 

From the integrality of flow, there is a collection V of L^xi J > 1-8|7W"| of edge-disjoint paths 
connecting the terminals in T{Ai") to the edges of out(S'). Let T' C T{M.") be the set of terminals 
where these paths originate, and let A^o ^ -M." he the subset of the demand pairs contained in T' ■ 
Since \T'\ > 1.8|A^"|, we get that |A^o| > \M"\/2. Thus, we have obtained a set Mo of il,{ki) demand 
pairs, such that all terminals in T{Aio) can be connected to the edges of out(5) via edge-disjoint 
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paths. 



In order to complete the proof of Theorem 12, we start by applying Lemma [T] to the set S = Si. 



If the outcome is a subset M' Q Ai of k/ poly log A: demand pairs and a routing of pairs in M' via 
edge-disjoint paths in G, then we return Ai' together with this routing, and terminate the algorithm. 

We therefore assume from now on, that Lemma [T] returns a set A^o ^ of ^{ki) pairs, together with 
the set V : T(A^o) "^i out(S'i) of edge-disjoint paths in graph G. We discard terminal pairs from Aio, 
and the corresponding paths from V until \M.o\ = L^2/4J, so \V\ = |T(A^o)| < ^2/2 holds. 

For each path P G V, we direct the path from the terminal towards ^i, and we truncate it at the first 
vertex that belongs to IJi=i ^i- '^^^ collection V of paths then connects every terminal in T{Mo) 
to some vertex of IJi=i '^i- Moreover, the paths in V do not contain the vertices of IJi=i '^i ^ inner 
vertices. 

We now combine the two sets of paths, V and Uee-B(f') to ensure that they cause congestion at 
most 1 in G' altogether, using the following simple claim. 

Claim 11 We can efficiently find, for each edge e G E{T), a subset "Pg C Ve of [k2/2\ paths, and a 
collection V' : T{A4o) '^i Ui=i of paths in G' , such that the paths in set (^Uee£;(T) U ^' cause 
congestion 1 in G' . 



Proof: The proof is very similar to the arguments used by Conforti et al. |CHR03j . We re-route the 
paths in V via some paths in Ue6£;(f') setting up an instance of the stable matching problem in 

a multi-graph. In this problem, we are given a complete bipartite multigraph G = {A,B,E), where 
|j4| = \B\. Each vertex v G AUB specifies an ordering Ry of the edges adjacent to v in G. A complete 
matching M between the vertices of A and B is called stable iff, for every edge e = {a,b) £ E \ M, the 
following holds. Let Ca, be the edges adjacent to a and h respectively that belong to M. Then either 
a prefers Ca over e, or b prefers over e. Conforti et al. |CHR03j . generalizing the famous theorem 
of Gale and Shapley |GS62j . show an efficient algorithm to find a complete stable matching M in any 
such multigraph. 

Let Q = Uee-E;(T) ™ instance of the stable matching problem as follows. Set A contains 

a vertex v{P) for each path P G V, and set B contains a vertex v{Q) for each path Q € Q. In order 
to ensure that |^| = \B\, we add dummy vertices to A as needed. 

For each pair P £ V, Q £ Q of paths, for each edge e that these paths share, we add an edge 
{v{P),v{Q)), and we think of this edge as representing the edge e. We add additional dummy edges 
as needed to turn the graph into a complete bipartite graph. 

Finally, we define preference lists for vertices in A and B. For each vertex v{P) £ A, the edges incident 
to v{P) are ordered according to the order in which they appear on the path P. The dummy edges 
incident to v{P) are ordered arbitrarily at the end of the list. For each vertex v{Q) G B, its adjacent 
edges are appear in the reverse order of their appearance on the path Q, with the dummy edges 
appearing in the end of the preference list. The preference lists of the dummy vertices are arbitrary. 

Let M be a stable matching in the resulting graph G. For each e G E{T), we let V'^ C be the set 
of paths Q, whose vertex v{Q) is matched to a dummy vertex in M. Since \V\ < k2/2 < |'Pe|/2, this 
ensures that \T"J > /C2/2. 

In order to construct the set V' of paths, consider some path P €z V. If P participates in the matching 
M via a dummy edge, then we add P to V' . Otherwise, assume that {v{P), v{Q)) G M, and the edge 
{v{P),v(Q)) corresponds to some edge e shared by both P and Q. We then replace P with a path P' 
that consists of two subpaths: a segment of P from the beginning of P until the edge e, and a segment 
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of Q from the edge e to the end of Q. Notice that P' starts at the same terminal as P, and terminates 
at some vertex of U[=i '^i- Path P' is then added to set V. 

This finishes the definition of sets V, {'Pe}eeB(f) paths. Observe that the paths in V connect every 
terminal in T{M.o) to some vertex of U5=i and they do not contain the vertices of Uj=i '^j as inner 
vertices. It now only remains to show that the set ([jeeE{f) ^Cj ^ ^' paths causes congestion 1 in 
G'. 

Clearly, the paths in Uee£'(f ) are edge-disjoint from each other in graph G', since the paths in 
Uee-B(T) were edge-disjoint from each other. 

Assume now for contradiction that some pair of paths P' , P" G V' share an edge. Assume w.l.o.g. that 
P' was obtained from concatenating a segment of some path Pi &V and some path Qi & Q via some 
edge ei (where possibly Qi is empty and P' = Pi), and similarly P" was obtained from concatenating 
a segment of some path P2 and some path Q2 & Q via some edge 62- The paths P', P" may share 
an edge only if the first segment of P' shares an edge with the second segment of P" , or vice versa. 
Assume w.l.o.g. that it is the former, that is, there is an edge e', that appears on Pi before ei, and e' 
appears on Q2 after 62 (notice that it is impossible that e' = ei or e' = 62, since Qi, Q2 cannot share 
an edge). But then both Pi and Q2 prefer the edge e' to their current matching in M, contradicting 
the fact that M is a stable matching. 

Finally, if some pair of paths P' G V' and P" G UeeE(f) share an edge, we reach a contradiction 
exactly as before. □ 

Let To = T{Mo). Let S ^ TZ' he the set such that at least iTol/?' paths in V' terminate at S. Let 
C V' be the subset of paths that terminate at S, and let CI To be the subset of terminals 
where these paths originate. Let M.^ C M.q be the subset of pairs (s,t) where both s,t G T^, and let 
M"^ C A^o be the subset of pairs (s,t) where s eT^, t ^T^. 



Assume first that \M.^\ > \T^\/4:. Then M.^ contains Q, > k4 demand pairs, and we have a set 

: T{M^) 5 of paths connecting the terminals participating in pairs in to the vertices of 

S. We discard arbitrary pairs in A4^, until \A4^\ = holds, and we discard the corresponding paths 
from V^. We then partition the resulting set T{Ai^) of terminals into two arbitrary subsets, Ti,Ti, 
such that for each demand pair (s, t) G A^^, exactly one terminal belongs to each of the subsets. This 
defines a corresponding partition {Vi,Vi) of the set of paths. We set S = S' in this case. 

From now on we assume that |A^^| < |T^|/4, and so \M'^\ = O (^'f^ . Let = {i | (s, t) G M'^, s G T^}, 

and let C P' be the subset of paths originating at the terminals of 7^. Let S' G TZ' be the set 
where at least \T^\/r of the paths in V'^ terminate. We let V" C P'^ be the subset of paths terminating 

at the vertices of S", and we let T{' C be the subset of terminals where these paths originate. We 
let T^' = {s I {s,t) G G Tl'}, V[ C the set of paths originating at the terminals of T{, and 

Ml C ^]^g Qf pg^jj.g ^g^^^ ^j^]^ s eT{, t e Ti' . Notice that |A^i| = = A;4 as required. □ 

Let V and v' be the vertices corresponding to the sets S and S' respectively in the tree T. Assume first 
that V ^ v' . Since the maximum vertex degree in T is at most 3, if we remove v and v' from T, we 
obtain at most 5 sub-trees. Let Tq denote the sub-tree incident on both v and v\ Ti,T2 the sub-trees 
incident on v, and T3,T4 the sub-trees incident on v' (possibly some of the sub-trees are empty). Let 
T G {Tl, . . . jT^} be the sub-tree containing the most vertices, and assume w.l.o.g. that T = Ti, so it 
is incident on v. Let T* be the tree obtained from T, by deleting the sub-trees T2, T3, T4 from it. It is 
easy to see that T* must contain at least r/4 > 27cmg vertices, and it contains both vertices v and v' . 
Moreover, the degree of v is at most 2, and the degree of v' is 1 in T*. We will view v as the root of 
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If V = v', then we root the tree T at vertex v, and consider the sub-trees rooted at the children of 
V. We delete all but one of these sub-trccs, leaving the sub-tree containing the most vertices. We 
denote by T* the resulting tree. In this case, the degree of f is 1, and T* again contains at least 27cmg 
vertices. 

Let TZ" C TZ' be the collection of sets Sj whose corresponding vertex vj G V{T*), so \TZ"\ > 27cmg- 

For each terminal t € T{Mi), let Pt & V[U V'l be the unique path originating at t. Recall that every 
edge e G out(S') U out(S") may participate in up to two paths in V'l U (and in this case S = S' 
must hold, and e must be the last edge on these paths). It will be convenient to ensure that each such 
edge participates in at most one such path. In order to achieve this, we construct two new subsets 
M[,M'( C Ml of demand pairs, as follows. Start with M[ = M'{ = 0. While Mi / 0, select any 
pair (s, t) G Mi, and delete it from Mi. Let e be the last edge on Pg, and let e' be the last edge on P(. 
If e = e', then we add pair (s, t) to M'l, and continue to the next iteration. Otherwise, we add (s, t) to 
M'l, and we delete from Mi every pair (s', t') G Mi, where either or P^i contain either of the edges 
e or e'. If, at the end of this procedure, \M'i\ > \Mi\/2, then we can route every pair G M'l by 
concatenating the paths Ps and Pt. This gives a routing of the paths in M'l with congestion at most 
2. Prom now on we assume that \M.'i\ < \Mi\/2. Observe that for each pair that we add to M", we 
delete at most three pairs from Mi- Therefore, \Mi\ > |A^i|/6. We delete demand pairs from M", 
until \M'(\ = [k4/6\ holds, and we delete from , Ti all terminals that do not participate in the pairs 
in M'(, and we delete their corresponding paths from V'l and P^'. For convenience, we will refer to 
M'l as Ml from now on. Observe that now every edge in out(S') U out(S") participates in at most one 
path in r[ U T^f . 

To summarize this step, we now have a tree T* with maximum degree 3, and a subset TZ" C 7?, of at 
least 27cMG vertex subsets, such that V{T*) = {vj \ Sj G TZ"}. Tree T* contains two special vertices, 
vertex v corresponding to the set S G TZ" - the root of the tree, whose degree is at most 2, and vertex 
v' corresponding to the set S' G TZ", whose degree is L It is possible that S = S' and so v = v'. 
For each edge e = {vi,Vj) G E(T*), we are given a set "Pg of \_k2/2\ paths in graph G, connecting 
the vertices of Si to the vertices of 5*^. We are also given a subset A^i of [^4/6] < [^2/2] demand 
pairs, and two subset T{,T" of terminals, such that for each pair {s,t) £ Mi, s & T{, t G T" or vice 
versa. Additionally, we have two sets V[ : T[ ^ out(S'), T'" : T" ^ out(S") of paths. The paths in 

P( U P(' U ^Uee£;(T*) ^Cj do not contain the vertices of Us"e7e" ^ inner vertices, and the total 
congestion they cause in graph G is at most 2. Each edge in out(S') U out(S") participates in at most 
one path in P^ U P^'. 

6.3 Step 3: Building the Good Crossbar 

We assume w.l.o.g. that TZ" = {Si, . . . , Sr"}, where r" > 27cmg. We say that a path P in graph G is 
good iff it does not contain the vertices of TU (^Uj=i '^j^ inner vertices. Consider some set Sj G TZ" , 
and let P be any collection of good paths in G. We denote by Tj{T^) C out(5j) the multi-set of edges, 
that appear as the first or the last edge on any path in P. That is, 

rj(P) = {e G out(S'j) I e is the first or the last edge on some path P G P} 

Our first step is to select, for each edge e G E{T*), a large subset Pg C P^ of paths, and two large 
subsets P2 C P(,P2 C p(' of paths, such that, on the one hand, for each I < j < r" , set Sj is 1-well- 

linked for the subset Tj{T'') of paths, where P' = P2 U P2 U ^UeeE(r*) ) > while on the other hand 
the endpoints of paths in Pg and P2 form source-sink pairs in set Mi- This is done in the following 
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theorem. 



Theorem 13 There is an efficient algorithm, that either computes a subset Ai' C J\Ai ofk/ poly log A; 
demand pairs and their routing with congestion at most 2 in G, or a (fci, a) -violating partition of some 
set Sj G TZ" , or it computes, for each edge e G E(T*), a subset "P" C V'^, of = $7(0^,^ ^2) paths, and 
two subsets V2 C V[,V2 C Pf, such that: 



• Each set Sj G TZ" is 1-well-linked for the set Tj{V') of edges, where V' = V2^V2^ I UeG£;(r*) 



• There is a subset M2 ^ Mi of Q.{a\,^ • \M.i\ = Q.{a\,^ ■ k^) demand pairs, and two subsets 
T2 C 77,7^" ^ Ti of terminals, such that for each pair {s,t) £ M.2, s G 7^',t G 7^" or vice 
versa. Moreover, V'2 connects every terminal 0/ 7^' to some edge in out(S'), and V2 connects 
every terminal o/7^" to some edge in out(5"). 



such set Sj, we compute the instance SC{G, Sj,T j) of the sparsest cut problem, where Tj = Tj{V), 
and run the algorithm .4arv on it. Let {A, B) be the output of the algorithm. If the sparsity of the cut 
{A, B) is less than a, then [A, B) defines a (fei, a)-violating partition of Sj. We then stop the algorithm 
and return this partition. Otherwise, we are guaranteed that set Sj is q;/q;ahv = OwL-well- linked for 



From now on, we assume that for each 1 < j < r" , set Sj is awL-well-linked for Tj{V). Our next step 
is to boost the well-linkedness of every subset Sj to 1, by selecting subsets of paths in each set (and 
in sets V[,Vi). This is done by iteratively applying Theorems [s] and [4] to the sets Si, ... , Sr". 

We start with the case where v' = v. We process the vertices of the tree T* in the bottom-up fashion. 
Let Vj be some vertex of T* . We maintain the invariant that once vertex Vj is processed, for every 
edge e in the sub-tree of vj, we have computed a subset V" C "P^ of at least k^ paths, such that the 
following holds. Let Q denote the union of all paths in the sets V'J, where e belongs to the sub-tree 
of Vj. Then for any vertex Vji in the sub-tree of Vj, set Sj' is 1-well-Iinked for Tj'{Q). Additionally, 
if e is the edge connecting Vj to its parent, then we have also computed a subset Pe ^ of at least 
k = 0(awL^2) paths, such that set Sj is 1-well-linked for the set Tj{Q U Ve) of edges. 

We now show how to process each vertex, so that the above invariant is preserved. Assume first that 
Vj is a leaf vertex, and let e be the unique edge of T* incident to Vj. Let F = Tj{Ve)- Consider the 
instance SC(G, Sj,T) of the sparsest cut problem. Recall that it is defined on a graph Gj(F), with a set 
T(F) of terminals, where each terminal te G 7'(F) corresponds to an edge e G F. We apply Theorem [s] 
to graph Gj{T) with a single set T(F) of terminals. Since Sj is awL-well-linked for F, graph Gj{T) is 
OwL-well-linked for 7~(F), and so we will obtain a subset 7~'(F) C 7~(F) of r2(awL^2) ^ k vertices. Let 
T'j C F be the subset of edges corresponding to the vertices in 7''(F). We define Pe ^ to be the 
subset of paths whose last edge belongs to F^-. Notice that each edge e' G F^- may serve as the last 
edge for at most two paths in T"^. If two such paths exist, then only one of them is added to Pe- This 
defines the required set Pe, with \Pe\ > k. Observe that Sj is 1-well-linked for Tj{Pe). 

Consider now some non-leaf vertex Vj 7^ v. We assume that Vj has two children. The case where Vj 
has one child is handled similarly. Assume that e is the edge connecting Vj to its parent, and 61,62 
are the edges connecting Vj to its two children in T* . We define three subsets of edges in out(S'j): Fq, 
containing all edges of out(5j) that belong to paths in T"^, and Fi, F2, containing the edges of out{Sj) 
that belong to the paths in Pe^ , Pe2 > respectively. Let F = F0UF1UF2. Recall that Sj is OwL-well-linked 
for F. As before, we build an instance SC(G, Sj,T) of the sparsest cut problem, with the graph Gj{T) 
and the set 7'(F) of terminals. Let 7o(F), 7i(F), 72(F) C 7'(F) be the subsets of these new terminals 




Proof: We start with V = UP('U MJeeEfT*) 



and we consider the sets Sj one-by-one. For each 
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corresponding to the edges in the sets ro,ri, and respectively. Then Gj{T) is awL-weU-hnked for 
^(r), and moreover 7o(r) > [k2/2\ and 71(r), 72(r) > k/2 (since each edge in out(5j) may participate 
in up to two paths in V). We apply Theorem [s] to graph Gj{T) and three subsets 7o(r), 7i(r), 72(r) 
of terminals. Let Tq{T) C 7o(r), T{{T) C 71 (F), and 7^'(r) C 72 (r) be the output of the theorem. 
RecaU that \Tq{T)\ > n{a^j^k2) > k, while |7;'(r)|, |7^'(r)| > n{a^nk) = n{al,^k2) > k^. Moreover, 
graph Gj(T) is 1-well-linked for 7^'(r) UT{(T) U7^'(r), and these vertex sets are mutually disjoint. As 
before, we let Fq C Tq be the subset of edges corresponding to the vertices of Tq{T), and we let T[,T2 
be the subsets of edges corresponding to the vertices of T{{T) and 7^'(r), respectively. Finally, we set 
Ve ^ T^e be the subset of paths that contain an edge in Fq (if some edge in Fq is contained in two such 
paths, only one such path is added to Pe)- We define T"J_^ C p^^ and VJ^ C p^,^ similarly, using the 
sets Fi,F2 of edges. Observe that \Pe\ > k, and {Veil, \'Pe2\ ^ ^5 ^ required. It is easy to verify that 
the invariant is preserved. 

In our final step we process the root vertex v = vj. Recall that since we have assumed that v = 
v', vertex v has degree 1. Let e be the unique edge incident on v, and let Sj S TZ" be the set 
corresponding to v. We again define three subsets Fo,Fi,F2 ^ ont{Sj) of edges, where Fq contains all 
edges participating in the paths in Ve, and Fi and F2 contain all edges participating in the paths in 
V'l and Vi, respectively. Notice that Aii defines a matching Ai over the set Fi U F2 of edges, where 
for each pair (s,t) G -Mi, if P,P' G "Pj U "Pf are the two paths originating at s and t respectively, 
and e',e" are the last edges on these two paths, then (e, e") G A^. As before, we set up an instance 
SC(G, 5j,F) of the sparsest cut problem, with graph Gj(T) and the set T(F) of terminals, where 
F = Fq U Fi U F2. Let 7o(F), 71(F) and 72(F) be the subsets of terminals in T(F) corresponding to 
the edges in sets Fq, Fi, and F2, respectively. We apply Theorem [4] to graph Gj{T), where the first 
set of terminals is 7i(F) U 72(F), together with the matching A4, while the second set of terminals 
is To{r). Let 7^'(F) QTo{T), 7]'(F) C Ti{T), and 7^'(F) C 7i(F) be the output of the theorem, and 
let Ai' C be the corresponding matching over the vertices of 7'{(T) U 7^'(F). As before, we define 
subsets Fq,F'^,F2 of edges corresponding to the terminal sets 7c[(F), 77(F) and 7^'(F), respectively. 
The subset C p^ of paths is defined exactly as before, from the subset Fq of edges. 

Finally, let V2 ^ V[,V2 ^ V'l be the subsets of paths containing edges in F'^ and F2, respectively 
(recall that each edge in F'^^ U Fg may belong to at most one such path) . Let 7^' C 7^ be the subset 
of terminals where the paths of V2 originate, and similarly let 72" ^ 7^" be the subset of terminals 
where the paths of V'-^ originate. Let A^2 = {(s, t) G A^i | s G 7^', t G 7^"}. From the above discussion, 
IAI2I = l-^'l ^ ^^(awL)|-A4i|. This finishes the proof of the theorem for the case where v = v' . 

Assume now that v ^ v' . The algorithm is exactly as before, except that the vertices v and v' are 
processed diff'erently. In order to process vertex v' , assume that v' = vj, and let e be the unique edge 
incident on v' in tree T* . We consider the two sets V'l, V{e) of paths, and the two corresponding 
subsets Fi, F2 C out{Sj) of edges, that lie on the paths in "P" and V'{e), respectively. We then process 
vertex v' as any other degree-2 inner vertex in tree T* , using the sets V'^, V{e) of paths, and the 
corresponding subsets Fi, F2 of edges. The output of this iteration is a subset V[e) C 'P'(e) of at least 
k paths, and a subset P" C Vl of at least f](awL)|-A^i| paths, such that set Sj is 1-well-linked for 
T,{P';uP{e)). 

Let 7j" C 7j" be the subset of terminals t, where Pt G V'l- Next, we discard from Mi all pairs 
(s,t) that do not contain a terminal in P"", obtaining a new set Ai C Aii of demand pairs. We also 
construct a set 7^ ^ 7^ of terminals, that participate in the pairs in M., and a set P[ C P[ of paths 
originating from these terminals. 

Vertex v is processed as follows. Let e, e' be two edges adjacent to v in T* (the case where the degree 
of u is 1 is handled similarly). We process vertex v exactly like all other inner vertices of tree T* , 
with the corresponding three subsets of paths: P(e),P(e'), and Pj. As a result, we obtain three new 
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subsets of paths: V"{e) Q V{e) and V"{e') C V{e') containing at least paths each, and a subset 
^2 ^ V'l of il(Q|,L)|-^i| paths. Let 7^' C be the subset of terminals t where Pt G ^^"^ 
A^2 ^ -A^i be the set of all pairs that contain a terminal in T^- We set ^ 7j" be the set of 
all terminals participating in pairs in A^2) and we let V2 ^ "P" be the set of paths originating at the 
terminals of 7^2- ^ 

Let fee = |A^2| = f^(awL)^4- Notice that fee < ^5- For each edge e G E{T*), while |P"(e)| > /ce, we 
discard arbitrary paths from V"{e), until |'P"(e)| = fcg holds. We now assume that |'P"(e)| = kg for all 
e e £;(r*), and recall that |p2M^2 I = 

We are now ready to define the good crossbar in graph G. Let S* contain all sets Sj, where the degree 
of vertex Vj in tree T* is either 1 or 2 (excluding the set S corresponding to the root vertex v). Notice 
that at least half the vertices of T* must have this property, and therefore, jS**] > 7cmg- If 15**1 > 7cmg) 
then we discard vertex subsets from S* arbitrarily, until 15**1 = 7cmg holds. 

Consider some set Sj £ S* , and let e be any one of the edges adjacent to the vertex vj in tree T* . We 
then define T*j C out(5j) to be the subset of edges that participate in the paths in V"{e). Recall that 
from Theorem 
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set Sj is 1-well-linked for T*-. 



Consider any set Sj S IZ" . If the degree of Vj in tree T* is 2, then let e, e' be the two edges incident to 
Vj in r*, and let F, T' C out(5j) be the sets of edges lying on the paths P"(e) and V'^e'), respectively. 
Since Sj is 1-well-linked for F U F', we can find a set Qj : F T' of paths contained in Sj. 

Assume now that the degree of Vj in tree T* is 3 (in this case, Sj 5*), and let e,e' and e" be the 
three edges incident to Vj in T*, where e connects Vj to its parent. Let F, F', F" C out(5j) be the sets 
of edges lying on the paths in V" {e),V" {e'), and P"(e"), respectively. Since Sj is 1-well-linked for 
F U F' U F", we can find two sets Q] : F i^i F' and ; r i^i F" of paths inside Sj . Let Qj = Q] U Q|. 

Assume now that v = v' , and let 5 be the set corresponding to the vertex v. Let e be the unique 
edge incident on v in tree T* , and let F C out (5) be the subset of edges lying on the paths in V"{e). 
Similarly, let F' C out (5) be the subset of edges lying on the paths in V2 U 7^2- We can then find a 
set Q{v) : F i^i F' of paths contained in set S. 

We are now ready to complete the construction of the good crossbar for the case where v = v' . We 
let M* = Ai2- For each terminal t G T{M*), we construct a tree Tt, which is then added to r*, as 
follows. We start with the path Pt £ V2 > ^-^^ let e be the last edge on this path. We then add to 
Tt the unique path in Q{S) that originates from e. Let e' be the last edge on this path, let e' be the 
edge of T* connecting v to its child, and let P be the unique path in V^, originating at e' . We then 
add P to the tree Tt. 

In general, we process the vertices of the tree T* in the top-bottom manner. We maintain the invariant 
that when a vertex Vj is processed, for each edge e adjacent to Vj in tree T*, one path of V'^ has been 
added to tree Tt. Vertex Vj is processed as follows. Let e be the edge connecting Vj to its parent, and 
let P G VJ be the path that belongs to tree Tt. Let e C out(5j) be the last edge on path P. If the 
degree of Vj is 2 in T*, then we add to Tt the unique path in Qj that originates at e. Let e' be the 
last edge on this path, and let e' be the edge in T* connecting Vj to its child. We add the unique path 
in T"J, originating at e' to the tree T. 

Otherwise, if the degree of Vj is 3, then we add to T two paths: Pi G Qj, and P2 € Q|, that originate 
at e. Let e',e" denote the two edges of T* connecting the vertex Vj to its two children. As before, we 
add a unique path P3 G V'J, , originating at the last edge of Pi , and P4 G V'J,, , originating at the last 
edge of P2 to tree Tt. 

If the degree of Vj is 1, then we do nothing. Once all vertices of T* are processed, we obtain a tree 
Tt, containing the terminal t, and for each set Sj G S* , a unique edge in F*. We let r* denote the 
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collection of all trees Tj, for all t G T{M*). It is immediate to verify that the trees in r* cause 
congestion at most 2 in graph G. Moreover, since the sets Sj G S* correspond to vertices of degree 1 
or 2 in r*, each edge of G[Sj] for such sets Sj belongs to at most one tree. 

Assume now that v ^ v' . Let e, e' be the two edges incident to the vertex v in tree T* (if the degree 
of V is 1, the argument is similar). As before, we denote by r',r" C out(S') the subsets of edges that 
lie on the paths in V'J, V'J,, respectively, and we let T C out(S') denote the subset of edges lying on the 
paths in V!^- We then compute two sets Qi{S) : F i^i F' , and 22(5') : F iii F" of paths contained in 
set S. 

Let e be the unique edge incident on the vertex v' in tree T*, and let F C out(>S") be the subset of 
edges participating in the paths in V". Let F' C out(S") be the subset of edges participating in the 
paths in 1^2- We compute a set Q{S') : F F' of paths contained in S' . 

For each terminal t G T{M.2), we now build a tree T^. This tree is constructed exactly as before, and 
it contains the path Pf & V2, and some additional path P G "Pg- before, for each edge e G T*, 
the tree contains exactly one path in VJ. Notice that now each tree T G r* contains two terminals: 
one in and one in 72"- We partition M2 into two subsets, M2 and M2, where M2 contains all 
pairs {s,t) where both s and t belong to the same tree in r*, and contains all remaining pairs. 
If |A^2l ^ |-^2|/2, then we can route every pair (s,t) G Ai2 along the tree T E t* containing s 
and t. This gives a routing of the pairs in M2 with congestion at most 2. We now assume that 
\M'^\ > \M2\/2. 

We construct the set M* C M2 of demand pairs as foUows. Start with M* = 0. While M2 / 0, select 
any pair {s,t) G move it to Ai*. Let Ts,Tt G r* be the trees in which s and t participate, 

respectively. Remove from Ai* all pairs (s',t'), where either s' or t' participate in either Tg or Tt. 
Observe that for each pair that we add to M*, at most three pairs are removed from M2- Therefore, 
\M*\ > |A^2|/3 > IAI2I/6 > ke/G. Finally, we remove from r* all trees that do not contain terminals 
in T(7W*). Observe that now each tree in r* contains exactly one terminal in T{M*)- For each set 
Sj G 5*, we also discard from F* all edges that do not participate in trees in r*. Observe that for 
each Sj G S*, we now have |r*| = |r*| > ke/G. Setting k* = |A^*| = |r*| finishes the construction. 
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A Proof of Theorems [s] and ^ 



We start with several definitions and observations that are common to the proofs of both theorems. 
Both proofs closely follow the arguments of |CKS04b] . Assume that we are given a connected graph G 
with maximum vertex degree at most 4, and a set T ^ ^(G) of terminals, such that G is a- well- linked 
for T, for some a < 1. Let q be the smallest even integer with q > A/a, so A/a < q < 8/a. We assume 
that \T\>q. 

We use the following simple theorem, whose proof is very similar to the proof of Theorem [2| to find 
an initial clustering of the vertices of G. 

Theorem 14 There is an efficient algorithm that finds a partition C of the vertices of G, where each 
set G €z C contains at least q and at most Aq terminals. Moreover, each cluster G £ C induces a 
connected component in G. 

Proof: Let T be any spanning tree of G. Our algorithm is iterative. We start with C = 0. During the 
algorithm, we remove some vertices from T, and we maintain the invariant that T contains at least q 
terminals. 

If T contains at most 4g terminals, then we add all the vertices of T into C as a single cluster, and 
terminate the algorithm. Otherwise, we find the lowest vertex v in T, such that the sub-tree rooted at 

V contains at least q terminals. Notice that since we have assumed that the maximum vertex degree in 
G is 4, V has at most three children, and the subtree of each such child contains fewer than q terminals. 
Therefore, the total number of terminals contained in the subtree of v is at most 3q. We add all the 
vertices in the sub-tree of f as a new cluster to C, and remove them from T. Notice that since we 
assumed that T contained at least Aq terminals at the beginning of the current iteration, it contains 
at least q terminals at the end of the current iteration. We then continue to the next iteration. □ 

Let Ti be the set of all sub-graphs induced by the vertices in the subsets in C, that is, 71 = {G[G] \ C £ C} 
We will refer to the sub-graphs H £ % as clusters. Notice that all clusters in % are vertex disjoint, 
and every vertex belongs to at least one cluster. For each cluster H £ T-L, we let T{H) C 7" be the 
subset of terminals contained in H, and for each terminal t £ T, we let H{t) £ % denote the unique 
cluster to which t belongs. 

Let E' C E{G) be the set of all cut edges of graph G (recall that e £ E{G) is a cut edge iff its removal 
disconnects the graph G). We need the following definition. 

Definition 6. Let H be any sub-graph of G, T{H) any subset of terminals contained in and let 

V £ T{H). We say that u is a center of H, iff v can send one flow unit to the vertices of T{H) in 
graph H, such that every vertex of T{H) receives at most 1/q flow units, and the flow on every edge 
e £ E{H) is at most 1/2. 

We say that v \s a, pseudo-center of iff v can send one flow unit to the vertices of T{H), with the 
same restrictions as above, except that the edges e £ E' are allowed to carry up to one flow unit. 

We need the following claim. 

Claim 12 Assume that we are given a collection Q o/ edge-disjoint sub-graphs ofG, a subset T{H) C 
T of terminals for each sub-graph H £ Q , and a subset T' of terminals, such that: 

• ForH^ H', r{H) n T{H') = 0. 

• For each t £ T' , there is a cluster H{t) £ Q , such that t £ T{H). Moreover, t is a pseudo-center 
ofH. 
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• For each H eG, \T{H) n T\ < 1. 
Then G is 1 -well-linked for T' . 

Proof: Let [A, B) be any partition of V{G). Denote Ta = ACi T' , Tb = B Ci T' , and assume w.l.o.g. 
that \Ta\ < \Tb\- In order to show that \E{A,B)\ > \Ta\, it is enough to prove that there is a flow 
F : Ta ^1 Tb G . Let '^Tb he any subset of terminals of size \Ta\- 

We compute the flow F in two steps. In the first step, we find a flow F' ■.Ta~~^ Tg, where the flow on 
every edge e E E{G) \ E' is at most 1, and the flow on every edge e G £" is at most 1.5. In the second 
step, we transform F' into the desired flow F : Ta '^i Tb- 

We start by showing the existence of the flow F' . Recall that each terminal t € T' is associated 
with a cluster H{t), and there is a flow Ft inside H{t), where t sends one flow unit to the vertices 
of T{H(t)), and each such vertex receives at most 1/q flow units. The congestion due to flow Ft on 
every edge e G E{G) \ E' is at most ^, and the congestion on every edge e E £" is at most 1. Since 
we have selected q to be an even integer, from the integrality of flow, we can assume that the flow Ft 
is l/(?-integral (to do so, set the capacity of every edge e ^ E' to be q/2, the capacity of every edge 
e G E' to q, add a source s, and connect it to every vertex v G T{H{t)) with an edge of capacity 1. 
The integral s-t flow of value q in this network, scaled down by factor q, gives the desired flow Ft). 
Every vertex v E T{H{t)) receives either or 1/q flow units in Ft. Let S't ^ T{H{t)) be the set of 
exactly q vertices that receive 1/q flow units in Ft. 

Let A' = UteT4 "^t' ^"^^ ^' ~ UteT^ '^t- Notice that \A'\ = q\TA\ =917^1 = \B'\, since the terminal 
sets T{H),T{H') are disjoint for H ^ H' . Since both sets A' and B' only contain terminals, and the 
graph G is a-well-linked for the terminals, there is a flow F" : A' -^2/0 B' in graph G. Scaling this 
flow down by factor q > 4/a, we obtain a flow F* , where every vertex v £ A' sends 1/q flow units, 
every vertex v £ B' receives 1/q flow units, and the total edge congestion is at most 1/2. Let flow F' 
be the concatenation of [JteTA ^* ^ '^tsT^ Then every vertex v £ Ta sends one flow 

unit in F' , every vertex v E Tg receives one flow unit, the flow on every edge e ^ E' is at most 1, and 
the flow on every edge e E -E" is at most 1.5. We assume w.l.o.g. that flow F' is half-integral. That is, 
we can decompose F' into flow-paths, each of which carries exactly 1/2 flow units. Notice that every 
vertex is an endpoint of either or 2 such flow-paths. 

We now transform the flow F' into a flow F : Ta '^1 Tb- This part is similar to the proof of Lemma 
3.10 in |CKS04b] . The transformation replaces every flow-path in F' by a corresponding simple flow- 
path, by removing all cycles on the path, if such exist. Let F be the resulting flow. It is now enough to 
show that every edge e £ E' carries at most one flow unit. Let e E -E" be any cut edge, and assume for 
contradiction that more than one flow unit is sent via e. Let X, Y be the two connected components 
of G \ {e}. Since the amount of flow on e is bounded by 1.5 and we assume that all flow-paths are 
simple and carry exactly 1/2 flow units each, there are three flow-paths that use edge e, and each such 
path connects a vertex of X to a vertex of Y. 

For every vertex v £ X, let ni{v) be the total number of flow-paths originating or terminating at v, 
and let ni = X]t;ex '^i(^)- Since ni{v) £ {0,2} for each v, ni is an even integer. Let n2{v) be the 
total number of flow-paths originating of terminating at v, whose other endpoint also belongs to X, 
and let n2 = X^^gx '^sl^^)- Since every flow-path contained in X contributes 2 to 712, n2 is also an 
even integer. However, since there are exactly three flow-paths that start at X and terminate at Y , 
ni —712 = 3, which is impossible. □ 
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Proof of Theorem [s] 



The proof roughly follows the arguments of |CKS04b] . and consists of three steps. In the first step, we 
define an initial clustering of the vertices of G and select one representative terminal from each cluster. 
If we could claim at this point that each selected representative is a pseudo-center of its cluster, then 



we would be done by Claim 12 However, this is not necessarily the case. To overcome this difficulty, 
we use a procedure suggested by [CKS04bj called tagging^ where each cluster C "tags" some other 
cluster C", such that the representative of C is a pseudo-center of the merged cluster C U C". The 
tagging procedure is performed in Step 2. Finally, in Step 3, we merge some pairs of clusters and 
select one pseudo-center for each such merged cluster. 



Step 1: Initial Clustering and Representative Selection We use Theorem 14 to find an initial 
clustering % of the vertices of G. Next, we select a representative for each cluster H £ Ti, using the 
following lemma. 

Lemma 2 There is an efficient randomized algorithm that w.h.p. finds, for each 1 < j < r, a subset 
Tj Tj of terminals, such that \Tj\ > ^{a\Tj\/r), the sets ^Tj^ o,re mutually disjoint, and for each 

cluster Hen, T{H) n (ULi 75') = 1- 



Proof: We start by selecting, for each 1 < i < r, at most one representative t £ T{H) D Tj for every 
cluster H, and we will later ensure that at most one representative is chosen for each cluster overall. 

Fix some 1 < j < r. We construct an initial set Tj ^ Tj of representatives, as follows. Start with 
Tj While Tj ^ 9, select any terminal t £ Tj and add it to Tj ■ Remove from Tj all terminals that 
belong to T{H[t)), and continue to the next iteration. Let T- be the final subset of terminals. Then 
\Tj \ > since for each terminal added to Tj, at most Aq terminals are removed from Tj. Moreover, 
for each cluster H £ T-L, at most one terminal from T{H) belongs to T-. 

Consider now some cluster H £%, and let 1^(11) be a multi-set, containing, for each 1 < j < r, the 
unique terminal in T{H) n TJ', if it exists (if some terminal t £ T{H) belongs to several sets then 
it appears several times in 1Z{H)). Since TZ{H) contains at most one representative from each set 
Tj , \TZ{H)\ < r. Cluster H then selects one terminal t £ TZ{H) uniformly at random, and terminal t 
becomes the representative of the cluster H. 

For each 1 < j < r, let Tj ^ Tj be the subset of terminals that serve as representatives of clusters in 

Ti. Then the sets \Tj( are mutually disjoint, and each cluster H £% has exactly one terminal in 
L J j=i 

T{H) n (^Uj=i T?') • It now only remains to show that for each 1 < j < r, \Tj\ > 0,{a\Tj\/r). 

Fix some 1 < j < r, and let fij denote the expectation of \Tj\. Then fij > \Tj\/r > |7^|/(4gr) = 

Q{a\Tj\/r) = il(rlogr). From the Chernoff bound, Pr \Tj\ < fJ.j/2 < e~'^^^^'^ < 1/r^'. Using the 

union bound, the probability that \Tj\ > 0,{a\Tj\/r) for all 1 < j < r is at least 1/r^'^^. By repeating 
the procedure a polynomial number of times, we can ensure that this algorithm succeeds w.h.p. □ 

For each cluster H £%, the unique terminal in T{H)r\ (^Uj=i 7?) is called the representative of H. So 

far we have constructed the sets Tj, for 1 < j < r of terminals that have all desired properties, except 
that we are not guaranteed that G is 1-well linked for Uj=i 75' • could guarantee that each terminal 
t £ [j^j^^Tj is a pseudo-center of its cluster H{t), then this property would follow from Claim |l2j 
However, we cannot guarantee this property at this stage. We overcome this difficulty in the next 
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step, using the tagging technique of |CKS04b] . For each 1 < j < r, we denote kj = \Tj\ = ^{a\Tj\/r). 

Step 2: Tagging The idea of the tagging technique is that we merge some pairs of clusters in T-L, 
and select one representative for each such merged cluster (from among the two representatives of 
the merged clusters), in a way that ensures that these representatives are pseudo-centers of their new 
clusters. We use the following result of |CKS04b] . 

Theorem 15 (Lemma 3.13 in \CKS04^ ) Let H £ T-L be any cluster, and let s G V^H) be any vertex 
that is not a pseudo-center of H . Then there is an edge e = {x,y) with x G V{H), y V{H), such 
that for any path P, connecting H to another cluster H' £ %, where the first edge of P is e, and P 
is edge-disjoint from H , vertex s is a pseudo-center of the cluster H L) P L) H' . Moreover, the edge 
e = {x,y) can be found efficiently. 

For every terminal t G U^=i Tj, iit is not a pseudo-center of H{t), let et = {xt, yt) be the corresponding 



edge guaranteed by Theorem 15, and let H'{t) be the cluster of T-i containing yt (such a cluster exists 



since every vertex of G belongs to one of the clusters in %, from Theorem 14). We then say that the 
terminal t and the cluster H{t) tag the cluster H'{t). 



we are 



Intuitively, we can now create a new cluster, by merging H{t) and H'{t). From Theorem 15 
guaranteed that t is a pseudo-center of the new cluster. However, since we require that the clusters 
are disjoint and every cluster only has one representative, the representative of the cluster H'{t) will 
need to be discarded. While in general we only expect to discard a constant fraction of the terminals, 
it is possible that many of these terminals belong to one of the sets TJ-, and we will end up discarding 
too many terminals from one such set. Another difficulty is when many terminals tag the same cluster 
H' . In this case, following the approach of |CKS04b] . we can partition all such terminals into pairs, 
and connect their corresponding clusters by edge-disjoint paths that are contained in H' . For each 
such new merged cluster, we can select one of the two representative terminals that is guaranteed to 



be a pseudo-center by Theorem 15 Again, we will need to discard some clusters (such as the cluster 
H' in this case), and we need to ensure that we do not discard too many representatives from any set 

%■ 

We start by building a collection H* ^ H oi clusters, such that for any pair {H, H') of clusters, 
where H tags H' , at most one of these clusters may belong to H* . We also ensure that among the 
representatives of the clusters in enough terminals belong to each set Tj. 

For convenience, for each 1 < j < r, we say that all terminals in 7j are of color j. For each terminal 
t G Tj, the color of the corresponding cluster H{t) is also j. Therefore, we have at least kj clusters 
of each color j. Following the arguments of |CKS04b] . we build a directed graph D, whose vertex set 
corresponds to the clusters in %, that is, V{D) = {vh \ H G Ti}, and there is an edge {vh,vh') iff H 
tags H' . We say that a vertex vh has color j iff the color of H is j. As in |CKS04b] . we need to find a 
large independent set I in graph D. However, we also need to ensure that I contains rL{kj/r) vertices 
of each color j. 

Claim 13 There is an efficient algorithm that finds an independent set T in graph D, containing at 
least Vt{kj/r) vertices of each color 1 < j < r. 

Proof: For each 1 < j < r, let k'j denote the number of vertices of color j in graph D. We will assume 
w.l.o.g. that k2 = k'^ = ■ ■ ■ = k[ = Q,{ak2/r), and k[ = Q{aki/r), by discarding, if necessary, some 
vertices from the graph D. We consider two cases. 

Assume first that k[ < rk2. Let /3 = rk'2/k'i. Notice that the out-degree of every vertex in D is at 
most 1. We say that a vertex vh of color 1 is good, iff the total number of edges adjacent to vh, 
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whose other endpoint's color is different from 1, is at most 8/3. Since the total number of vertices 

in the graph D is at most r/c2 + < 2rk'2 = 2^^^, the total number of edges is also at most 2f3k[. 
Therefore, at least half the vertices of color 1 are good. Let D' be the sub-graph of D induced by 
the good color-1 vertices. Since the out-degree of every vertex in D' is at most 1, and D' contains at 
least k[/2 vertices, we can efficiently find an independent set Ii of size exactly [^J in graph D' , by 
iteratively greedily selecting a vertex with minimum total degree. The selected vertex v is then added 
to Ii, and it is deleted from D' , together with all its neighbors. Let Ii be the resulting independent 
set of size [^J , and let V be the subset of the vertices of colors 2, . . . , r, whose neighbors belong to 
Xi. Then \V'\ < [^J • 8/3 < ^. 

Let D" be the sub-graph of D induced by the vertices of colors 2, . . . , r, that do not belong to V' . 
Then for each 2 < j < r, D" contains at least /C2/2 vertices of color j. Next we construct, for each 

k' 

2 < j < r, a set Ij of [g^J vertices of color j, such that Uj=i^j is an independent set in graph D. 
We start with Ij = for all 2 < j < r, and the perform iterations. In each iteration, wc select a 
minimum-degree vertex v from D" , and add it to the set Ij, where j is the color of v. Wc then delete 
V and all its neighbors from D" . If, at any point of the algorithm execution, the size of the set Ij, 

for any 2 < j < r, reaches [|^J, then we delete all the remaining vertices of color j from D", and 
continue. Notice that since the out-degree of every vertex is at most 1 in any induced sub-graph of 
D" , we can always find a vertex whose degree is at most 2 in the current graph. Therefore, in each 
iteration, at most 3 vertices are deleted from D". 

It is easy to see that at the end of the algorithm, for each 2 < j < r, = [|^J. Indeed, assume 

otherwise, and assume that for some j* , set Ij* contains fewer than [|^J vertices upon the termination 
of the algorithm. But the original graph D" contained at least ^2/2 vertices of color j* , the number 

k' k' 

of iterations is bounded by (r — 1) • [g^J < and in each iteration at most 3 vertices of color j* are 
deleted from D" . Therefore, upon the termination of the algorithm, at least one vertex of color j* 
must be present in the graph, a contradiction. The output of the algorithm is Uj=i^j- 

fc' 

Consider now the second case, where k\ > rk2- Let = We say that a vertex of colors 2, . . . ,r 
is good if it has at most 8/3r neighbors of color 1. Since the total number of edges in the graph is 
bounded by k'^ + rk'2 < 2k'i = 2/3rA;2, for every color 2 < j < r, at least half the vertices of color j are 
good. We construct a sub-graph D' of graph D, induced by all good vertices of colors 2, . . . ,r. Since 
graph D' contains at least ^3/2 vertices of each color, using the same algorithm as before, we can find, 
for each 2 < j < r, a collection Xj of ^ good color-j vertices, such that the set X' = Uj=i is 
independent set in D'. Let V' be the subset of color-1 vertices that have neighbors in X'. Then, since 
all vertices in X' are good, \V'\ < r ■ ■ 8f3r = = Let V" be the subset of color-1 vertices 
that do not belong to V', and let D" be the subgraph of D induced by V" . Then D" contains at least 
k[/2 vertices, and using the same algorithm as before, we can find an independent set Xi of size at 
least k[/8 in graph D". We then output Uj=i^i! which is guaranteed to be an independent set. □ 

Let C H contain the set of clusters H whose corresponding vertex vh G X. For each 1 < j < r, let 
f* C fj denote the sets of terminals t where H{t) G H*. Then for each l<j<r, \ f*\ = n{a\Tj\/r^) 
w.h.p. 

Step 3: Merging the Clusters In this step we merge some pairs of clusters, and select a represen- 
tative for each new merged cluster, such that on the one hand, this representative is a pseudo-center 
of the merged cluster, and on the other hand, we obtain enough representatives from each set 7^* of 
terminals. 

We construct a new collection Q of merged clusters, and the final collection T' = Uy=i Tj represen- 
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tative terminals, where for each 1 < j < r, Tj QT* ■ We start with ^ = and T' = 0. 

We add to Q all clusters H G 71* that do not tag any cluster. Recall that in this case, the unique 

terminal t G T{H) n (^Uj=i "7^*^ is a pseudo-center of H. We add t to T' ■ 

Consider now some cluster H £ H, and let Q{H) C %* be the set of all clusters in %* that tag H. If 
= 1, then let H' G %* be the unique cluster that tags H. Let t G Uj=i "Tj* be the representative 
of H, that is, t G T{H). We then create a new merged cluster, H* = H \J H' L) et(//), which is added 



to Q. We set T{H*) = T{H) U T{H'). By Theorem 15 we are guaranteed that t is a pseudo-center 
oiH*. We add t to V. 

Assume now that |^(//)| > 1. We use the following lemma from jCKS04b] . 

Lemma 3 (Lemma 3.14 in fCKSOj^) Let T be a tree and A some even multi-set of vertices in V{T). 
Then we can efficiently find \A\/2 edge-disjoint paths in T, such that each vertex v £ A is the endpoint 
of exactly n„ of these paths, where v occurs times in A. 

Let Q{LL) = {Hi, . . . , Hp}, and for each 1 < i < p, let ti £ T{Hi) be the unique terminal that belongs 
to Uj=i "Tj ■ If P is odd, then we choose one of the clusters Hi, ... , Hp uniformly at random, and 
discard it from Ti* . Notice that p > 3 must hold in this case, so the probability that any cluster is 
discarded is at most 1/3. We assume from now on that p is even. Let A = {yti,yt2i ■ ■ ■ ^Utp}- We use 
Lemma [s] to find a collection V of edge-disjoint paths contained in H (by first building a spanning 
tree T of H), where every path connects a distinct pair yti,ytj of vertices. For each such path P gV, 
if P connects yj. to yj ., then we construct a new cluster H* = HiL) P L) Hj, and add it to ^. We set 



T{H*) = T{Hi) U T{Hj). Notice that from Theorem 15, both ti and tj are pseudo-centers of H* . We 



select one of these two terminals uniformly at random, and add it to T' . 

Consider the final set Q of clusters. From the construction, we are guaranteed that all these clusters are 
edge-disjoint, since the original clusters % were vertex-disjoint. Every terminal t G 7"' is guaranteed 
to be a pseudo-center of its cluster in ^. It is now enough to show that for each 1 < j < r, at least 
175*1/12 = n{a\Tj\/r'^) terminals from f* belong to V w.h.p. 

Fix some 1 < j < r, let t G T* be any terminal, and let H{t) G %* be the cluster that t represents. 
Recall that H[t) may be discarded from %* with probability at most 1/3 (if H{t) tags some other 
cluster H' , for which the parameter p is an odd number greater than 1), and additionally, if H{t) is 
merged with another cluster H' , then t is added to T' with probability at least ^. Using the standard 
Chernoff bound, the probability that more than a 2/3-fraction of clusters of color j are discarded from 

%* is at most l/poly(r), and similarly, the probability that fewer than terminals from Tj are 
added to T' is bounded by 1/ poly(r), since 175*1 > VL{a\Tj\/r'^) = Jl(logr). Using the union bound 
over all 1 < j < r, with probability at least ^, set T' contains at least |75*|/12 terminals from TJ-, 
for all 1 < J < r. By repeating this procedure a polynomial number of times, we can ensure that it 



succeeds with high probability. Applying Claim 12 to the set Q of clusters and the set T' of terminals, 
we conclude that G is 1- well- linked for T' . 

Proof of Theorem |4] 

The proof closely follows the proof of Theorem [3j except that now we are only given two subsets 71 , T2 
of terminals, and we need to ensure that the terminals in the set 7^' are chosen so that for every pair 
(s, t) G M, either both s, t belong to T{, or none of them. 



Step 1: Initial Clustering and Representative Selection As before, we use Theorem 14 to 



find an initial clustering T-L. We then select representatives for the clusters H £%, using the following 
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analogue of Lemma [2j 

Lemma 4 There is an efficient algorithm, that either routes a subset Ad' C 7W of n{aki) pairs on 
edge-disjoint paths in G, or finds two subsets Ti ^ 71 and T2 ^ T2, with \Ti\ = i^{aki), |72| = ^l{ak2), 
such that: 



• 71 n7i = 0, 

• for each cluster H eU, \T{H) n (Tl U 7i)| < 1, and 

• for each pair {s,t) £ Ai, either both s,t belong to Ti, or neither of them does. 



Proof: We start by selecting a subset T2 Q 72 of representatives from the set 72, exactly as in the 

iq 



proof of Lemma 2 We construct a collection 7^' C 72 of at least |^ terminals, such that for each 



cluster H £ Ti, at most one terminal of T{H) belongs to 72- 

The selection of representatives for 71 is performed slightly differently. We construct a new matching 
Ml C A^, that consists of the selected representatives. We start with Aii =0. While 7^ 0, 
let (s,t) E be any pair of terminals in A4. Add {s,t) to A^i, and remove from 71 all terminals 
contained in T{H{s)) U T{H{t)). Remove from M all pairs that are not contained in the current set 
71 of terminals. Let Aii be the final matching that the algorithm has constructed. It is easy to verify 
that \M.i\ > since for every pair that we add to A^i, we remove at most 8q pairs from A4. Let 

71' C 71 be the subset of terminals that participate in pairs in A^i. Then for every cluster H £ T-L, 
either \T{H)nfi\G {0, 1}, or |r(-ff) n 711 = 2. In the latter case, the two terminals s,t eT{H)nfi 
belong to A^ as a pair. 

Partition Aii into two subsets M\ and A^'/, where Ai'^ contains all pairs {s, t) where both s, t G T{H) 
for some cluster T{H), and Ad" contains all remaining pairs. If |A^x| > ||A^i|, then we have found a 
collection A4'i C A^ of i}{aki) pairs, where each pair {s,t) S Ai'i is associated with a distinct cluster 
H £ Ti, where s,t £ T{H). Since each cluster H £ 71 is connected, and the clusters are mutually 
edge-disjoint, we can route the pairs in Ai'i on edge-disjoint paths. Therefore, we assume from now 
on, that {Ai'd > |A^i|/2 > To simplify the notation, we denote Ai'i by Aii from now on. We 

denote by 71' the set of terminals participating in the pairs in A^i, so |71'| = Q{aki). 

Notice that now for every cluster H £ Ti, at most one terminal t £ T{H) belongs to 71', and at most 
one terminal t' £ T{H) belongs to 72, while it is possible that t = t' . If T{H) contains both a terminal 
of 71 and a terminal of 72, then we say that iJ is a mixed cluster. Let 7i' be the set of all mixed 
clusters. 

We partition H' into two subsets T-l[,T-L2, such that \Ti'i\, {7121 > \H'\/2 — l. Our partition ensures that 
if, for some pair (s,t) £ A^i, there are two clusters H,H' £ %' with s £ T{H) and t £ T{H'), then 
either both H, H' £ T-L'^, or both H, H' £ 7^2- The partition is performed by using the following greedy 
procedure. Start with 'H[,7i2 = 0. While Ti' ^ 0, let H £%' he any cluster, and let s be the unique 
terminal in T{H) n 71. Let t be the terminal such that £ A^i, and let H' £ Ti he the cluster 

for which t £ T{H'). If H' ^ H' , then we add H to either T-L'^ or ?^2 " whichever currently contains 
fewer clusters. Otherwise, we add both H and H' to one of the two sets ^^'^,'^2, that contains fewer 
clusters. The ties are broken arbitrarily. Let {%'i,%'2) be the resulting partition of %' . 

For each cluster H £ T-L'i, we remove the unique terminal t £ T{H) n 72 from 72'- For each cluster 
H £ 'H'2, let t he the unique terminal in T{H) n T^', and let s £'T{ such that (s, t) £ Aii. We remove 
(s, t) from Ail, and we remove both s and t from 71'. Notice that since {Ti'il > \'H2 \ — 2, we only remove 
a constant fraction of the terminals from 71'. Therefore, at the end of this procedure, we obtain a 
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subset Ai Ai of pairs of terminals, with = il.{aki), a set 71 of terminals participating in the 
pairs in A4, and another set ^ Ti of terminals, with jTil > 0,{ak2), such that 71 H Ti = 0, and for 
every cluster H €z H, T{H) contains at most one terminal t G 7i U Ti- □ 

From now on, we denote by C 7W the collection of terminal pairs contained in 71. For each cluster 
H G Ti, we say that the unique terminal t E T{H) Pi (7i U Ti) is the representative of H. Notice that 
some clusters H G T-L may have no representative. 

Step 2: Tagging The tagging is performed exactly as before. Consider some cluster H G T-L, that 
has a representative t G T{H) n (71 U Ti). If t is not a pseudo-center of H, then we can find an 



edge et = {xt,yt) with xt G V{H),yt V{H), guaranteed by Theorem 15 Let H' G V. he a, cluster 



containing the vertex yt (again, such a cluster must exist since every vertex belongs to some cluster 



by Theorem 15). We say that cluster H and terminal t tag the cluster H' . 



We say that a pair (s, t) G 7W is good if H{s) tags H{t) or vice versa. Let Ai' C 7\4 be the set of all 
good pairs, and let M." = M. \M.' . If > \M.\/2, then we can route all pairs in Ai' via edge-disjoint 
paths, where each pair (s, t) G Ai' is routed inside the connected component H(s) U H{t) U e^, if H{s) 
tagged H{t), or inside the component H{s) U H{t) U et if t tagged s. It is easy to see that all resulting 
paths are edge-disjoint. Therefore, we assume from now on that \Ai"\ > \A4\/2. To simplify notation, 
we will refer to A^" as 7W, and we discard from 7i terminals that do not belong to the new set Ai. 

We say that the terminals of 71 are red and the terminals of Ti are blue. If H G Ti has a representative 
terminal in 71, then we say that H is a red cluster; if it has a blue representative, then it is a blue 
cluster. Notice that it is possible that H has no representative, in which case it has no color. 

As before, we would like to find a subset H* ^ % of clusters, such that whenever H tags H\ only 
one of these two clusters belongs to %* . However, we have an additional restriction: for every pair 
(s, t) G Ai,we would like to ensure that either both H{s) and H{t) belong to %* , or none of them. 

In order to achieve this, we construct a directed graph D. The set of vertices of D consists of two 
subsets: the set of vertices representing the terminals in 72, V2 = |ft | t G 72 1, and the set of vertices 

representing the pairs of terminals in A^, Vi = ^Vg^t \ {s,t) G Ai^. 

We say that a vertex vt G V\ represents the cluster -ff(t), and a vertex Vs± S V2 represents the clusters 
ff(s) and -ff(t). We let V(I)) = V1UV2, and we add an edge {u, u') to D iff at least one of the clusters 
represented by u tags a cluster represented by u' . We denote |Vi| = ki and IV2I = /c2. Notice that 



ki = ^{aki) and k2 = Vt{ak2). We use the following variation of Claim 13 



Claim 14 There is an independent setX in D, containing at least [fci/24j vertices ofVi and at least 
[/c2 / 24J vertices of V2 ■ 



Proof: For simplicity, we treat D as a general directed graph, where the out-degree of every vertex is 
at most 2. Therefore, we can assume w.l.o.g. that ki < k2, and we denote (3 = k2/k\ We say that a 
vertex v gVi \s good iff it is adjacent to at most 8/3 vertices of V2. Since the total number of vertices 
in the graph is bounded by 2^fei, the number of edges is bounded by 4/3fei, and so at least half the 
vertices of Vi are good. 

Let D' be the sub-graph of D induced by the good vertices in Vi . Since the out-degree of every vertex 
in D' is at most 2, and > fci/2, we can find an independent set Xi in D' of size exactly [A;i/24J , 

using the standard greedy algorithm as before. 

Let V C V2 be the subset of vertices adjacent to the vertices of Xi. Since all vertices in Xi are good, 
\V'\ < 8/3 [5! J < ^. Let D" be the sub-graph of D induced by the vertices of V2\V' . Since the 
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out-degree of every vertex in D" is at most 2, and |y(-D")| > IV2I/2, we can find an independent set 
X2 of size [^2/24] in D" . We output Xi UX2 as our final solution. □ 

Let M* C M denote the set of pairs (s, t) with Vg^t G X, let 7^* C 71 be the set of all terminals 
participating in pairs in M* , and let T2 be the set of terminals t with vt £ X. Finally, let ^l* = 
{H{t) I t e 7^* U 72"}- Then for every cluster H eH*, T{H) has exactly one representative in T1UT2, 
and for any pair H, H' of clusters, where H tags H' , only one of the two clusters may belong to T-L* . 
Notice that we are also guaranteed that \Ti\ = $l(A;i) = Q{aki), and |72*| = ^{k2) = ^}{ak2). 

Step 3: Merging the Clusters In this step, we create the final set G of clusters, by merging 
some pairs of clusters in 71*. For each cluster H £ Q, we first select up to two terminals t,t' £ 
T{H) n {T* U72*) that we call potential pseudo-centers for H. In the end, at most one such terminal 
will be selected for every cluster. 

li H £ %* is a cluster that does not tag any other cluster, then we add H to Q. Let t E T{H) be 
the unique representative of H in Ti UT2 ■ Then we say that t is the potential pseudo-center of H. 



Notice that from Theorem 15 t is indeed a pseudo-center of H. 

Consider now some cluster H £ T-L, and let Q{H) C %* be the set of clusters H' that tag H. Assume 
that Q{H) = {Hi, . . . , Hp}. For each 1 < i < p, let tj G T{Hi) be the representative of Hi in 
7^* U 7^*. If p = 1, then we create one new cluster H* = H U Hi U {et^} and add it to G. We set 
T{H*) = T{Hi) U T{H), and we let ti be its potential pseudo-center. Notice that ti is indeed a 
pseudo-center of the new cluster, from Theorem |15[ 

Assume now that p > 1, and it is even. As before, we use Lemma [3] to find a collection V of edge- 
disjoint paths contained in H, where every path connects a distinct pair yti,yt of vertices. For each 
such path P £ V, if P connects yt- to yt^, then we construct a new cluster H* = Hi U P U Hj, and 



add it to g. We set T{H*) = T{Hi) U T{Hj). Notice that from Theorem [15} both ti and tj are 
pseudo-centers of the new cluster. We say that ti and tj are potential pseudo-centers for the new 
cluster, and we will later select at most one of them to represent this cluster. 

Finally, assume that p is odd. If the number of blue clusters in Q{H) is at least two, then we discard 
one arbitrary blue cluster from Q{H), and continue as in the case where p is even. Otherwise, Q{H) 
must contain at least two red clusters. In this case we say that H is an odd cluster. 

For each such odd cluster H, we would like to select one red cluster H' £ Q{H) to discard. However, 
once such a cluster is discarded, if s G T{H') is the representative of H' in 7]* U 7^*, and t is the 
terminal with (s, t) £ M* , we will need to discard the terminal t as well. That is, t will not be able to 
serve as a pseudo-center of its cluster H{t). In particular, if H{t) £ Q{H') of some other odd cluster 
H' , we need to ensure that H{t) is the cluster that we discard from Q{H') in this case. 

We proceed in two stages. In the first stage, while there is a pair (s, t) £ M* , such that H{s) £ G{H), 
H{t) £ Q{H'), with H 7^ H\ and both H and H' are odd clusters, we discard H{s) from Q{H), H{t) 
from Q{H'), and {s,t) from M* . The clusters H and H' now stop being odd, as |^(//)|, are 
now even. We process these two clusters as in the case where p is even. When no such pair (s, t) £ Ai* 
remains, the first stage finishes and the second stage begins. We process the remaining odd clusters 
one-by-one. For each such cluster H, there must be at least one red cluster H' £ G{H), such that, if 
t £ T{H') is the representative of H' in 7^*, and s is the terminal with {s,t) £ M* , then H{s) does 
not belong to the set Q{H") of any odd cluster H" . We then discard {s,t) from M.* , and discard 
H' from Q{H). If s is a potential pseudo-center for any cluster in Q, then it stops being a potential 
pseudo-center for that cluster. 

Let M** C M* be the set of all surviving pairs. We need the following simple claim. 
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Claim 15 \M**\ > |M*|/4. 

Proof: Consider the first stage of the algorithm, where wc have removed a cohection of pairs {s,t), 
where H{s) G Q{H), H{t) £ Q{H'), for odd clusters H and H' . For each such odd cluster H* , we have 
removed at most one such pair. However, since Q{H*) contained at least two red clusters, there is 
another pair {s' ,t') G M* , such that H{t') G Q{H*), and {s',t') was not removed from M* during the 
first step. It is easy to verify that at most half the pairs in Ai* were removed during the first stage. 
Let Ai' be the collection of pairs that survive after the first stage. 

In the second stage, for each remaining odd cluster H, we select one red cluster H' G Q{H), and 
delete the pair (s, t) with H(s) = H' from Ai'. However, since H still remains an odd cluster, we are 
guaranteed that there is another pair {s',t') G M', such that H{s') G Q{H), and H{t') does not belong 
to any current odd cluster. (Otherwise we could have continued the first phase for another step). We 
can therefore charge the pair (s, t) to the pair (s' , t'), that will not be removed during the second step. 
So overall, at most half the pairs are removed from A4' during the second stage. □ 

We conclude that |A^**| = il{aki). Let Ti* be the set of all red terminals participating in the pairs 
in M** , and let 7^** be the set of all blue terminals that serve as potential pseudo-centers for clusters 
in ^. It is easy to see that |7^**| > |7^*|/2 = fl{ak2), since every blue cluster discarded from H* can 
be charged to another blue cluster that remains in H*. 

To summarize, we have defined a collection Ti* of edge-disjoint clusters, a set Ai** C of Q{aki) 
pairs of terminals, a subset T{* C 71 of terminals that participate in the pairs in Ai**, and a subset 
T2* C 7^ of n{ak2) terminals, with T^* n 7^** = 0. Moreover, every terminal t e T{* U T2* is a 
potential pseudo-center for one of the clusters in Q, and every cluster H e Q now has either 0, 1, or 2 
potential pseudo-centers. The sets {'T{H)}jj^g of terminals are mutually disjoint. 

Notice that it is possible that for some clusters H £ Q, we have two potential pseudo-centers s,t 
for H, which form a pair in Ai** . Let A4\* C Ai** be the subset of all such pairs of terminals. In 
other words, {s,t) G Ail* both s and t are potential pseudo-centers of the same cluster H G Q. If 
|A^**| > |A^**|/2, then every pair in Ml* can be routed inside its own cluster, via edge-disjoint paths. 
In this case we terminate the algorithm and return this collection of paths. From now on wc assume 
that this is not the case. To simplify notation, we denote by A^** the set of pairs in A^** \ A^^*, and 
by 7^** the set of terminals participating in pairs in A^**. 

Step 4: Selecting the Pseudo-Centers In this step we select a subset At' C A^** of terminal 
pairs, a set T{ C Ti* of terminals participating in pairs in Ai' , and a subset 7^' C T2*, such that for 
each cluster H e Q, at most one potential pseudo-center belongs to 7^ U 7^'. 

Our first step is to ensure that every cluster in Q contains at most one red potential pseudo-center. 
In order to do so, we start with Ai' = 0. While Ai** ^ 0, we select an arbitrary pair {s,t) G Ai**, 
remove it from Ai** and add it to Ai' . Assume that s is a potential pseudo-center of H, and f is a 
potential pseudo-center of H' . We also remove from Ai** any other pair {s',t'), where either s' or 
t' are potential pseudo-centers of H or H'. Notice that for each pair added to Ai', at most three 
pairs are deleted from Ai**. Therefore, \Ai'\ > |A1**|/3 = Cl{aki). Let T{ be the set of all terminals 
participating in pairs in Ai**. 

Next, we ensure that every cluster in Q contains at most one blue potential pseudo-center. In order to 
do so, we start with 7^' = 0. For every cluster H G Q that contains two blue potential pseudo-center, we 
select one of these two blue terminals arbitrarily and add it to T-^- Notice that |72 1 > |72**|/2 = Q.{ak2). 

Finally, we need to take care of clusters H G Q that contain one red and one blue potential pseudo- 
centers. We call such clusters mixed clusters, and we take care of them like in Step 1. Let ^' C ^ be 
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the set of all mixed clusters. We partition Q' into two subsets G'i,G2, such that |, |^2l — l^'l/^ — 1- 
As before, we ensure that if, for some pair (s, t) £ M' , there are two clusters H, H' G G' , where s is a 
potential pseudo-center of H, and t is a potential pseudo-center of H' , then either both H,H' E G[, 
or both H, H' £ The partition is performed exactly as in Step 1. 

For each cluster H G G[, we remove its blue potential pseudo-center from 7^'. For each cluster H £ G'2, 
let t be its red potential pseudo-center, and let s G T{ such that {s,t) G Ai' . We remove (s,t) from 
M' , and we remove both s and t from T{- Since \Q'i\ > |^2l ~ 2, we only remove a constant fraction of 
the terminals from T^'. Therefore, at most a constant fraction of the pairs is removed from Ai' , and 
at most a constant fraction of terminals is removed from 72 • 

Let Ai','T{,72 be the resulting subsets of terminal pairs and terminals. Then \7i \ = ^l{aki), \T2 \ = 
Q{ak2)- For every cluster H G G, at most one potential pseudo-center for H belongs to T{uT2- Using 



Claim 12 graph G is 1-well-linked for the set T' = T{ U T2 of terminals. 
B Proof of Theorem [l] 

In this section we complete the proof of Theorem [T| using Theorem |6j 

As in the previous work on the EDPwC problem [RZ10|,IXndl01IChul2| . we follow the outline of |CKS05] . 
by first partitioning the graph G into a collection of sub-instances, where each sub-instance is well- 
linked for the corresponding set of the terminals. We then solve each such sub-instance separately, 
by embedding an expander into it. The specific type of embedding that we use is similar to the one 
proposed in |Chul2| . and is summarized in the following definition. 

Definition 7. Let Ai' C 7W be a subset of the demand pairs, and let T' = T{Ai') be the set 
of terminals participating in the pairs in Ai' . Let H be any expander whose vertex set is T'- An 
embedding of H into G maps every vertex t G V{H) into a connected component Ct G with t G Gt, 
and every edge e = (t, t') G E{H) into a path Pg in graph G, connecting some vertex v G Ct to some 
vertex v' G Cf. Given an edge e' G E{G), the load on edge e' is the total number of components 
{Ct}jg^/ and paths {Pe}e£E{H) which edge e' belongs. The congestion of the embedding is the 
maximum load on any edge e' G E{G). 

Our algorithm uses the cut-matching game of |KRV06j . summarized in Theorem [sj to embed an 
expander into the graph G. To simplify notation, we say that an A^-vertex (multi)-graph X is a good 
expander, iff N is even, the expansion of X is at least acMG(A^)> a-nd the degree of every vertex of X 
is exactly 7cmg(A). Once we embed a good expander into the graph G, we will need to route demand 
pairs across X via vertex-disjoint paths. 

There are many algorithms for routing on expanders, e.g. |LR991 [BFU94t |BFSU94[ |KR96| IFriOO] . that 
give different types of guarantees. We use the following theorem, due to Rao and Zhou [RZlOj (see 
also a proof in |Chul2j ). 

Theorem 16 (Theorem 7.1 in |RZ10j ) Let G = {V,E) be any n-vertex d-regular a- expander. As- 
sume further that n is even, and that the vertices of G are partitioned into n/2 disjoint demand pairs 

Ai = {(si,ti), . . . , (s„/2i*n/2)}- Then there is an efficient algorithm that routes Q. ^ ^^^^.^2 ^ of the 
demand pairs on vertex- disjoint paths in G. 

Corollary 1 Let X be a good N-vertex expander, and assume that the vertices of X are partitioned 
into N/2 disjoint demand pairs Ai = {(si,ti), . . . , (sAr/2)iAr/2)}- Then there is an efficient algorithm 

that routes ( , ^'^''^^o'^^^ln ) = 0, ( , ^ I of the demand pairs on vertex- disjoint paths in X . 
V log ^-70^10 (^)/ Vlog^/ J f 
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The fohowing theorem allows us to construct a good expander H and embed it into G with congestion 
at most 2, given a good crossbar (S* , M* ,t*) in graph G. 

Theorem 17 Assume that we are given an undirected graph G = {V,E) with vertex degrees at most 
4, and a set M of k demand pairs, defined over a set T of terminals. Assume further that the degree 
of every terminal is 1, every terminal participates in exactly one pair in Ai, and G is 1-well-linked 
for T ■ Then there is an efficient randomized algorithm, that with high probability outputs one of the 
following: 

• Either a subset M' C of k/ poly log /c demand pairs and the routing of the pairs in Ai' with 
congestion at most 2 in G; 

• Or a good expander H together with the embedding of H into G with congestion at most 2. 

Proof: We apply Theorem [g] to graph G. If the outcome is a subset j\4' M. oik/ poly log k demand 
pairs and their routing in M' with congestion at most 2, then we return this routing, and terminate 
the algorithm. We assume from now on that the algorithm in Theorem |6] produces a good crossbar 
{S*,M*,T*). 

The good expander H is constructed as follows. The set of vertices of H, V{H) = T{M*)- The 
embedding of each vertex t G V{H) is the unique tree T G r*, to which terminal t belongs. In order to 
compute the set of edges of H and their embedding into G, we perform the cut-matching game, using 
Theorem [5] Recall that this game consists of 7cmg iterations, where in iteration j, for 1 < j < 7cmg) 
the cut player computes a partition (Aj, Bj) of V{H), and the matching player responds with a perfect 
matching between the vertices of Aj and the vertices of Bj. Given the partition {Aj,Bj) of V{H), 
computed by the cut player, we find a corresponding partition {A'j,Bj) of T*, as follows. For each 
ti G V{H), if ti G Aj, then we add Cij to A'j, and otherwise we add it to I3'j. Since the set Sj is 

1-well-linked for F*, we can find a collection Q* : A'j Bj of edge-disjoint paths contained in S* . 
These paths define a matching M,- between the edges of A'- and the edges of B'-, which in turn defines a 
matching between the terminals of Aj and the terminals of Bj . We then add the edges of the matching 
to the graph H. For each such edge e G M,-, its embedding into G is the corresponding path Q G Q*. 
From Theorem [5| after 7cmg iterations, we obtain a good expander H, together with its embedding 
into G. It is easy to see that the embedding causes congestion at most 2 in graph G. Indeed, consider 
any edge e £ E. If edge e belongs to any sub-graph G[Sj], for Sj G S* , then it belongs to at most one 
tree in r*, and to at most one path in {Pg' I e' G E{H)}. If e does not belong to any such sub-graph, 
then it is contained in at most two trees in r*, and it is not contained in any path in {Pg/ | e' G E{H)}. 
Therefore, the total congestion of this embedding is at most 2. □ 

We are now ready to complete the proof of Theorem [T] Our starting point is similar to that used in 
previous work on EDP |CKS04b[ [CK505l iRZTOl lAndlOi [CEil2] . We use the standard multicommodity 
flow LP-relaxation for the EDPwC problem to partition our graph into several disjoint sub-graphs, that 
are well-linked for their respective sets of terminals. In the standard LP-relaxation for EDPwC, we 
have an indicator variable Xi for each 1 < i < k, for whether the pair (sj, ti) is routed. Let Vi be the 
set of all paths connecting Si to ti in G. The LP relaxation is defined as follows. 
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While this LP has exponentially many variables, it can be efficiently solved using standard techniques, 
e.g. by using an equivalent polynomial-size LP formulation. We denote by OPT the value of the 
optimal solution to the LP. Clearly, the value of the optimal solution to the EDPwC problem instance 
with congestion 2 is at most OPT. 

Notice that if we replace Constraint Q with the following constraint: 



then we obtain the standard multicommodity LP relaxation for the EDP problem itself, where no 
congestion is allowed. Let OPT' denote the optimal solution value of this new LP. Then clearly 
OPT' < OPT < 20PT'. 

The next theorem follows from the work of Chekuri, Khanna and Shepherd |CKS04bl ICKSOSj . and 
we provide a short proof sketch for completeness. 

Theorem 18 Suppose we are given a graph G = {V, E) and a set Ai of k source-sink pairs in G. 
Then we can efficiently partition G into a collection Gi, . . . ,Gi of vertex- disjoint induced sub-graphs, 
and compute, for each 1 < i < i, a collection M.i M. of source-sink pairs contained in Gi, such 
that = ri(OPT/ log^ k), and moreover, if for each 1 < i < £, Ti denotes the set of terminals 

participating in pairs in Mi, then % ^ V{Gi), and Gi is 1-well-linked for %■ 

Proof: We need the following definition. 

Definition 8. Given a graph G = {y,E)^ and a subset T V oi vertices called terminals, we say 
that T is fiow-well-linked in G, iff any matching M on T can be fractionally routed with congestion 
at most 2 in G. 

The next theorem follows from the work of Chekuri, Khanna and Shepherd jCKS04bl ICKSOSj . and 
its proof also appears in |Chul2| . 

Theorem 19 Suppose we are given a graph G = {V, E) and a set M of k source-sink pairs in G. 
Then we can efficiently partition G into a collection Gi, . . . ,G£ of vertex- disjoint induced sub-graphs, 
and compute, for each 1 < i < i, a collection A4i ^ M of source- sink pairs contained in Gi, such 
that = ^i(OPT/ log^ k), and moreover, if for each I < i < i, Ti denotes the set of terminals 

participating in pairs in Mi, then % ^ V{Gi), and Gi is flow-well-linked for %■ 

Fix some 1 < i < I. From the min-cut max-flow theorem, since Gi is flow-well-linked for 71, then it 
must be 1/2- well- linked for 71. We apply the grouping technique from Theorem [4] to graph Gi, where 
the first set of terminals is 7i, together with the matching Mi defined over 7i, and the second set of 
terminals is 0. Notice that a = 1/2 in this case. If the outcome is a subset M'i C Mi of r2(|A^i|) 



f{P) < 1 yeeE. 
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pairs, together with their routing on edge-disjoint paths, then graph Gi is 1-well-hnked for T{M[)- 
Otherwise, we obtain a subset M.'^ C M.^ of r2(|A^i|) pairs, where Gi is 1-well-Hnked for T{M.'j)- In 
any case, for each graph Gi, we now obtain a subset A4[ C Ji4i of the demand pairs, such that Gi is 
1-well-hnked for T{M'i), and J2i=i = ^{OPT/ log^ k). □ 

We now proceed to solve the problem on each one of the graphs Gi separately. For each i, let ki = \M'i\- 
Notice that in order to complete the proof of Theorem [T| it is now enough to show that there is an 
efficient randomized algorithm that for each i < i < i, routes poly log /cj) pairs in A4[ with 

congestion at most 2 in graph Gi w.h.p. 



Fix some 1 < i < i. We apply Theorem 17 to graph Gi and the set M'^ of the demand pairs. If the 
output of Theorem 17 is a collection A4'- C of fej/ poly log /cj demand pairs, together with their 
routing with congestion at most 2 in Gj, then we terminate the algorithm and return this routing for 
graph Gi. Otherwise, we obtain a collection Ai'- C A^'. of ki / poly log ki demand pairs, a good expander 
H whose vertex set is V{H) = T{M'l), and the embedding of H into the graph Gi with congestion 

2. Using Theorem 16 



/ \jv["\ \ 

we can compute a collection M.i C M.'l of ( i a^'/iu ) = ^{ki/ poly log fcj 



^ (log I A^^' 1)4 

demand pairs, and a routing V of the pairs in Mi in the expander H via vertex-disjoint paths. We 
now construct a routing of the pairs in A4i in the graph Gi, with congestion at most 2. 

In order to construct the routing, consider any path P G V, and assume that it connects a pair 
{s,t) £ Aii. We transform the path P into a path Q, connecting s to t in graph Gi, as follows. 
Assume that P = (to = s,ti, . . . ,tr = t), and let ej be the edge connecting tj to tj+i, for < j < r. 
For each such edge ej, let Pej be the path via which ej is embedded into G. Let Vj be the first vertex 
on Pg. , and let Uj+i be the last vertex on P^. . Then vj belongs to the connected component Ctj , and 
Uj+i belongs to the connected component Ct^+i • 

For each 1 < j < r, let Qj be any path connecting uj to vj inside the connected component C{tj). Let 
Qq be any path connecting to to vq inside Ct^, and let Qr be any path connecting Ur to tr inside Ct^. 
Path Q is then obtained by concatenating Qo, -feo; Qi-, ■ ■ ■ ■, Pr-i, Qr- Notice that path Q connects the 
original pair (s, t) of terminals to each other. 

Let Q be the final set of paths, obtained after processing all paths P £ V. From the above discussion, 
every pair (s, t) £ Aii has a path connecting s to t in Q. In order to bound the congestion caused by 
Q, recall that the paths in V are vertex-disjoint. Therefore, the paths in Q traverse every component 
Ct for t € V{H) at most once, and use every path Pe foi" e £ E{H) at most once. Since the congestion 
of the embedding of H is bounded by 2, the paths in Q cause congestion at most 2 overall. 

To conclude, for each 1 < i < we have computed a collection Ai* C M.i of ki/ poly log fej demand 
pairs, and a routing Qi of the pairs in M* with congestion at most 2 in Gi. Since X]j=i — 
r2(0PT/log^ k), overall Ui=i Qi ^ (poly log A;)-approximate solution for the EDPwC instance (G, A^) 
with congestion 2. 

C Proof of Claim |4 



Let H = G'\T. Fix some 1 < j < 7. Let £i{j) be the bad event that J2vex, ^uiv) > ■ [l + ^ 
In order to bound the probability of we define, for each vertex v G V{H), a random variable 

Xy, whose value is "^^j"^ if f G Xj and otherwise. Notice that x^ G [0, 1], and the random variables 
{^■u}.ui^y{^H) pairwise independent. Let B = Ylivi^v{H)^v- Then the expectation of B, /ii = 
Yliv<^V{H) '^^ki'^ ~ Using the standard Chernoff' bound (see e.g. Theorem 1.1 in |DP09| ). 
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10m 

< 

7 



, 2m 1 

Pr [£^{j)] = Pr [5 > (1 + 1/7) fii] < e-^i/(37 ) = g < 

67 

since m > k/6 and h = 192^3 i^g^ - 

For each terminal t £ T, let ej be the unique edge adjacent to t in graph G' , and let be its other 
endpoint. Let U = {ut \ t G T}. For each vertex u £ U, let w{u) be the number of terminals t, 
such that u = Uf. Notice that w{u) < ki must hold. We say that a bad event S2{j) happens iff 
"^u&unXj wlu) > I • ^1 + In order to bound the probability of the event £2ij), we define, for each 
u € U, a random variable yu, whose value is w{u)/ki iff n G Xj, and it is otherwise. Notice that 
Uu G [0, 1], and the variables yu are independent for all u £ U. Let Y = Ylm^uHu- The expectation 

of y is /X2 = and event £2{j) holds IS Y > ^ ■ [l + ^ > /U2 • (^1 + Using the standard 
Chernoff bound again, we get that: 

Pr [£2{3)] < 6-^2/(372) < g-fe/(3fci7^) < Jl 

67 

since ki = 192^3 log^ - Notice that if events £i{j),£2{j) do not hold, then: 

ioutG'(x,)i< Yl (1 + ^) + 

since m > k/6. 

Let <?3(j) be the bad event that \Ec'{Xj)\ < We next prove that Pr [iSaO')] ^ 67- ^^^^ 
two edges e, e' £ E{G' \ T) are independent iff they do not share any endpoints. Our first step is 
to compute a partition Ui, . . . ,Ur of the set E{G' \ T) of edges, where r < 2A;i, such that for each 
1 < i < r, \Ui\ > and all edges in set Ui are mutually independent. In order to compute such a 
partition, we construct an auxiliary graph whose vertex set is {ve \ e G E{H)}, and there is an edge 
{ve,Vf>f) iff e and e' are not independent. Since the maximum vertex degree in G' is at most fci, the 
maximum vertex degree in Z is bounded by 2ki — 2. Using the Hajnal-Szemeredi Theorem |HS70| . 
we can find a partition Vi, . . . ,Vr of the vertices of Z into r < 2ki subsets, where each subset Vi is an 
independent set, and \Vi\ > ^'^^f^^ — 1 > The partition Vi, . . . ,Vr of the vertices of Z gives the 
desired partition Ui, . . . ,Ur of the edges of G' \ T. For each 1 < i < r , we say that the bad event £^ {j ) 
happens iff \Ui D E{Xj)\ < Notice that if £3{j) happens, then event £^{j) must happen for some 
1 < i < r. Fix some 1 < i < r. The expectation of H E{Xj)\ is = Since all edges in Ui are 
independent, we can use the standard Chernoff bound to bound the probability of £l{j), as follows: 

Pr = Pr [\U n E{Xj)\ < < e-'^''^ = e"^ 

Since \Ui\ > m > k/Q, ki = ^92^3 and 7 = e(log'^A;), this is bounded by We conclude 

that Pr [<?3(i)] < -^2fci7 ' ^^"^ using the union bound over all 1 < i < r, Pr [<?3(j)] < 

Using the union bound over all 1 < j < 7, with probability at least i, none of the events £i{j) , £2{j) , £i,{j) 
for 1 < j < 7 happen, and so for each 1 < j < 7, | oviiG'{Xj)\ < and \EG'{Xj)\ > ^ must hold. 
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D Integrality Gap of the Multi-Commodity Flow Relaxation for EDP 

Recall that in the multcommodity flow relaxation of the EDP problem, the goal is to maximize the 
total amount of flow sent between the demand pairs, with no congestion, with at most one flow unit 
being sent between any pair. The gap construction for this relaxation, due to Garg, Vazirani and 
Yannakakis |GVY9 3] starts from a sub-graph of the {k + 1) x (k + 1) grid depicted in Figure [2j Every 
degree-4 vertex in this graph is then replaced by the gadget depicted in the figure. This gadget ensures 
that whenever two paths traverse a degree-4 vertex of the grid, one vertically and one horizontally, 
then they must share an edge. 




S2 ■■■ 



Figure 2: The gap example for the multicommodity flow relaxation 

Notice that every pair (sj,tj) can send ^ flow unit from Si to ti along the path that contains the 
vertical segment of the grid incident to Si and the horizontal segment incident to U. Therefore, the 
value of the multicommidity flow relaxation is k/2 = Q{^/n). However, it is easy to see that the 
value of the optimal integral solution is 1. Indeed, for any path P connecting any pair (sj,tj), if we 
remove all edges of P from the graph, then all other demand pairs become disconnected. Therefore, 
the integrality gap of this example is Q{^/n). 

Notice also that in this example we can route k = ^}{^/n) demand pairs integrally with congestion 
2, while at most one demand pair can be routed with congestion 1. This shows a polynomial gap 
between routing with congestion 1 and routing with congestion 2 and higher. 

Figure [3] depicts a brick- wall graph. Obtaining sub-polynomial approximation algorithms for EDP on 
brick-wall graphs remains an interesting open problem. 



Figure 3: A brick- wall graph 
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E Table of Parameters 



m 


e{logk) 


Flow-cut gap in undirected graphs. 


7CMG 


e(log^ k) 


Parameter for the cut-matching game, from Theorem 5 


7 




JNumber of sets m the initial partition, l-c = {bi, . . . , o^j 


7 




Threshold for the number of leaves for Cases 1 and 2 




0(VlogA;) 


Approximation factor of the algorithm of |ARV09j for Spars- 
est Cut. 


a 


1 _nr 1 ^ 




Parameter for well-linked decomposition. 


Owl 


a/aARv(fc) = ^ (iog9.5fe) 




Well-linkedness parameter. 


r 


87CMG 


Number of sets in TZ' 


ki 


19273 log 7 "Vlog24fc 


k 

log 


Og fc 


) 


Threshold for large clusters. 


k' 


fcl 
67^ 


\ poly log fc J 




Parameter in the construction of graph Z in Step 1. 


k2 


n ( ''^'^■^A = n( k 

\ yi.j 1 \ polylo 






Number of paths for every edge in T in Step 1. 


ki 






Number of demand pairs routed to S, S' in Step 2. 


ke 




3gfc 


) 


Number of trees in r* is at least k^/l. 
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